Qdrant

Type: full-code · Vendor: Qdrant · Language: Rust · License: Apache-2.0 · Status: active · Status in practice: mature · First released: 2021-05-01

Links: homepage docs repo

Qdrant is a vector database that stores embeddings as points and retrieves the most similar ones at query time, with payload-based partitioning so one shared instance can isolate each user's or tenant's vectors.

Description. Qdrant is an open-source vector search engine written in Rust. It stores vectors (embeddings) together with JSON payloads and returns the nearest neighbours for a query vector, which is how agents and RAG pipelines retrieve semantically similar memories or documents. Search can be filtered by payload conditions and combined across dense and sparse vectors. For multi-user deployments, Qdrant supports payload-based partitioning within a single collection so each tenant can only access their own vectors.

Agent loop shape. Qdrant is the retrieval backend an agent calls, not an agent loop. Documents or memories are embedded and upserted as points with payloads; at query time the agent embeds its query and asks Qdrant for the top-k nearest points, optionally constrained by a payload filter. In multi-tenant use the query carries a tenant identifier filter so the search only ranges over that tenant's vectors, and the returned items are fed back into the agent's context.

Primary use cases

storing embeddings for semantic retrieval
nearest-neighbour vector search for RAG and agent memory
payload-filtered and hybrid search
multi-tenant vector isolation

flowchart TD fw["Qdrant"] fw --> p1["Vector Memory<br/>(core)"] fw --> p2["Hybrid Search<br/>(first-class)"] fw --> p3["Tenant-Scoped Tool Binding<br/>(supported)"] fw --> p4["Cross-Encoder Reranking<br/>(supported)"]

Key concepts

Point → vector-memory (docs) — Qdrant's stored unit: a vector (embedding) together with an optional JSON payload and id, which is what nearest-neighbour search ranks and returns.
Payload filter (docs) — Conditions (must / should / must_not) on payload fields applied alongside vector similarity so search ranges only over points matching the metadata constraints.
Hybrid query → hybrid-search (docs) — A query combining dense and sparse vector results, fused via Reciprocal Rank Fusion or Distribution-Based Score Fusion, to get both semantic and exact-keyword relevance.
Prefetch → cross-encoder-reranking (docs) — A nested sub-query whose results the outer query re-scores, enabling multi-stage retrieval where a cheap vector fetches candidates and a richer representation reranks them.