/ MONGODB PARTNER STACK· Atlas AI Search

AI search & retrieval on MongoDB Atlas — vector, full-text, hybrid.

humaineeti engineers the retrieval layer for RAG and agents on MongoDB Atlas — Atlas Vector Search, Atlas Search, hybrid retrieval, and Voyage AI embeddings and reranking, on your operational data.

One database for documents and vectors. No separate vector store to run or sync. Chunking, embedding, hybrid retrieval, reranking, evaluation, and citation-grounded answers — governed, residency-aware, and built for the enterprise.

Most RAG projects stall on the retrieval layer, not the model. MongoDB Atlas puts your operational data and its vector embeddings in one database — so humaineeti builds semantic, lexical, and hybrid search on the data you already run, wires in Voyage AI embeddings and reranking, and grounds every answer in citations. No second system to provision, no sync pipeline to babysit.

One platform, operational data + vectors

Atlas AI search and retrieval unifies your documents and their vector embeddings in a single database. There is no separate vector store to stand up, operate, or keep in sync with the source of truth — which removes an entire class of dual-write and staleness bugs that plague bolt-on vector stacks.

Atlas Vector Search

Atlas Vector Search runs semantic, similarity search over vector embeddings. Rather than matching exact keywords, it finds the documents whose meaning is closest to the query — the retrieval primitive underneath modern AI search.

Hybrid search (vector + full-text)

Pure semantic search misses queries where exact terms carry the meaning — a code identifier, a policy number, a proper noun. Pure keyword search misses queries where intent matters more than the words used. Hybrid search fuses both.

Voyage AI embeddings & reranking

Voyage AI provides MongoDB's embedding and reranking models — state-of-the-art vectorisation and relevance scoring. humaineeti picks the right model per corpus and language, then layers reranking on top to sharpen the final ranking.

voyage-4-large

Best general-purpose & multilingual retrieval quality.

  • The default when retrieval accuracy is paramount.
  • The voyage-4 series shares one embedding space, so models mix cleanly.

voyage-context-3

Contextualised chunk embeddings.

  • Each chunk is embedded with its surrounding document context.
  • Higher accuracy on long, chunked documents; multilingual.

voyage-multimodal-3.5

Rich multimodal embeddings.

  • Vectorises interleaved text and visuals with one model.
  • PDF screenshots, slides, tables, figures, and video.

rerank-2.5

Generalist, instruction-following reranker.

  • Re-scores candidates by true query relevance; multilingual.
  • Works over embedding or lexical (BM25/TF-IDF) results.

The family extends further: voyage-code-3 for code retrieval with lower-dimensional quantized embeddings, domain-specialised models for finance and law, and a lighter rerank-lite-1 reranker. Quantized embeddings cut storage and latency cost; multimodal embeddings extend retrieval to images and video — including intelligent video search that pinpoints the exact moment a query refers to. (Voyage AI on MongoDB Atlas is a Preview capability.)

How humaineeti engineers RAG on Atlas

The model is the easy part. The retrieval layer is where accuracy is won or lost, and it is what humaineeti builds and evaluates end to end on Atlas.

  1. Chunking. Split source documents into retrieval units sized for the corpus and the questions users actually ask — not arbitrary fixed windows.
  2. Embedding. Choose the right Voyage AI model per corpus (general voyage-4-large, contextual voyage-context-3, code, multimodal, finance, legal), with quantized embeddings where cost and latency matter.
  3. Hybrid retrieval. Combine Atlas Vector Search and Atlas Search so both semantic and exact-term queries land the right documents.
  4. Reranking. Apply a Voyage AI reranker (rerank-2.5) over the candidate set to lift the most relevant results to the top before generation.
  5. Citation-grounded answers. Every generated answer carries its sources, so responses are traceable and auditable rather than free-floating.
  6. Evaluation. Measure retrieval and answer quality with Ragas and custom scorers via Eval@Core — on your corpus and queries, not vendor benchmarks — so changes are proven, not assumed.

Below, the shape of a retrieval-then-generate loop — conceptual, engine-agnostic on the model side:

# pseudocode — retrieval on Atlas, model-agnostic generation
q_vec   = voyage.embed(query)                 # Voyage AI embedding
sem     = atlas.vector_search(q_vec, k=50)    # Atlas Vector Search
lex     = atlas.text_search(query, k=50)      # Atlas Search (BM25)
cands   = fuse(sem, lex)                       # hybrid retrieval
top     = voyage.rerank(query, cands, k=8)    # Voyage AI reranker (rerank-2.5)
answer  = llm.generate(query, context=top)    # cite every source

Governance, residency & evaluation

Retrieval touches your most sensitive data, so humaineeti builds the governance in from the start — especially for India-based and regulated workloads.

The stack

A single, coherent retrieval stack humaineeti stands up on your operational data:

Frequently asked questions

What is MongoDB Atlas Vector Search?

Semantic, similarity search over vector embeddings stored alongside your operational data. It finds the documents whose meaning is closest to a query — the retrieval layer that powers RAG, agentic retrieval, and semantic search. Because vectors live in the same database as your documents, there is no separate vector store to run or sync.

What is hybrid search?

Hybrid search combines Atlas Vector Search (semantic) with Atlas Search (Lucene-based full-text, BM25 lexical matching) and fuses the results. It catches queries where meaning matters and queries where exact terms matter. On most enterprise corpora it is more relevant than either approach alone.

What is Voyage AI?

MongoDB's embedding and reranking models. Featured embeddings include voyage-4-large (general-purpose, multilingual), voyage-context-3 (contextualised chunks), and voyage-multimodal-3.5 (text + visuals, incl. video); voyage-code-3 and finance/legal models extend the family. Rerankers rerank-2.5 (generalist, instruction-following) and the lighter rerank-lite-1 re-score query-document relevance over embedding or lexical (BM25/TF-IDF) results.

Do I need a separate vector database?

Not with Atlas. It unifies operational data and vector embeddings in one database, so there is no separate vector store to provision, operate, or synchronise. That removes dual-write and staleness bugs and simplifies governance, backup, and access control.

Related from humaineeti

Ask for a free demo