Built with Rust — Zero GC Pauses

The Fastest
Vector Search Engine

FerresDB delivers sub-millisecond vector search with hybrid BM25 retrieval, gRPC streaming, tiered storage, and enterprise-grade RBAC — all powered by Rust for uncompromising performance in RAG, semantic search, and recommendation systems.

Get Started with Docker View Documentation

<500μs

P50 Search Latency

Sub-millisecond

50K+

Vectors/Second

Indexing throughput

REST, gRPC, WS, MCP

Multi-protocol

HNSW

ANN Algorithm

High recall rate

Use Cases

Built for AI-Native Applications

From RAG pipelines to real-time recommendations, FerresDB powers the most demanding vector workloads.

Semantic Search

Transform user queries into meaning-based results with Cosine, Euclidean, or Dot Product similarity. Combine with metadata filters for precision retrieval.

Vector Search

Metadata Filters

Budget-aware

RAG Pipelines

Hybrid vector + BM25 search in a single query with weighted or RRF fusion. Ground your LLM responses with the most relevant context from your knowledge base.

Hybrid Search

BM25 Fusion

RRF & Weighted

Recommendations

Real-time similarity matching with WebSocket streaming. Dot Product distance optimized for recommendation models. Auto-batching up to 1000 points/request.

WebSocket

Dot Product

Auto-batch

Core Features

Everything You Need, Nothing You Don't

A complete vector database with enterprise-grade features, built from the ground up in Rust for maximum performance and reliability.

Sub-Millisecond Latency

P50 search at 100-500μs, P95 at 200-1000μs. No GC pauses — Rust delivers predictable, low-latency execution with zero runtime overhead.

Hybrid Vector + BM25

Combine dense vector search with BM25 text retrieval using weighted fusion or Reciprocal Rank Fusion (RRF). Tunable alpha parameter for precision control.

Multi-Protocol: REST, gRPC, WebSocket, MCP

REST API for simplicity, gRPC with bidirectional streaming for high-throughput, WebSocket for real-time applications, and MCP (Model Context Protocol) via STDIO for Claude Desktop and other AI assistants. All protocols run in parallel.

Tiered Storage (Hot/Warm/Cold)

Automatically move vectors between RAM (Hot), memory-mapped (Warm), and disk (Cold) tiers based on access frequency. HNSW graph stays in memory for speed.

RBAC + Audit Trail

Role-based access control with Admin, Editor, and Viewer roles. Granular per-collection permissions with metadata restrictions. Daily-rotated audit logs.

Zero-Downtime Reindex

Rebuild HNSW indexes in the background. Searches continue on the old index until the new one is ready. Auto-triggers when tombstones exceed 20%.

WAL + Snapshots

Write-Ahead Log with periodic snapshots every 1000 ops. Automatic crash recovery replays the WAL from the latest snapshot. Auto-save every 30 seconds.

Full Observability

Prometheus metrics endpoint, query profiling with /search/explain, slow query tracking, cost estimation with budget_ms, and a built-in web dashboard.

Official TypeScript & Python SDKs

Fully-typed TypeScript SDK with Zod validation and WebSocket support. Async Python SDK with httpx. Both feature auto-retry, auto-batching, and structured logging.

Architecture

Engineered for Production

Every layer of FerresDB is designed for performance, safety, and operational excellence.

System Layers

API LayerActix-Web + Tonic gRPC

REST, gRPC (port 50051), WebSocket — all running in parallel

Auth LayerAPI Keys (SHA-256) + JWT (Argon2)

RBAC with Admin/Editor/Viewer roles, per-collection permissions

Search EngineHNSW + BM25 + LRU Cache

Cosine/Euclidean/DotProduct metrics, metadata filters, hybrid fusion

Storage EngineWAL + Snapshots + Tiered Storage

Hot (RAM) / Warm (mmap) / Cold (disk), auto-save every 30s

ObservabilityPrometheus + OpenTelemetry

Metrics, query profiling, slow queries, daily audit trail (JSONL)

HNSW Parameters

The Hierarchical Navigable Small World index is tuned for an optimal balance of speed and recall.

mMax connections per layer

ef_constructionIndex build quality

200

ef_searchQuery search width

Storage Layout

{STORAGE_PATH}/
├── collections/
├── points.jsonl# Current state
├── wal.jsonl# Write-ahead log
├── snapshot.jsonl# Every 1000 ops
└── index.bin# HNSW index
├── api_keys.db# SHA-256 hashed
├── users.db# Argon2 passwords
└── logs/
└── audit-*.jsonl# Daily rotation

Benchmarks

Performance That Speaks for Itself

Benchmarked with Criterion.rs — real numbers, not marketing claims

Indexing Throughput

1K vectors(Small)

50K–100Kpts/s

10K vectors(Medium)

30K–60Kpts/s

100K vectors(Large)

20K–40Kpts/s

Search Latency

P50(Median)

100–500μs

P95(95th)

200–1000μs

P99(99th)

500–2000μs

Why FerresDB is Fast

Rust Foundation

No GC pauses, zero-cost abstractions, memory safety without runtime overhead. Compiled to native machine code.

HNSW Algorithm

Multi-layer graph with O(log N) search complexity. Optimized for high recall with configurable ef_search.

Parallel with Rayon

Thread-safe design with parallelized batch operations. Ready for multi-threaded servers and concurrent requests.

LRU Search Cache

Optional caching for repeated queries. Configurable cache size eliminates redundant computation on hot queries.

FerresDB vs The Rest

See how FerresDB compares to conventional vector databases

Aspect	FerresDB	Others
Language	Pure Rust — zero GC, native performance	Python, Go, or Java with GC overhead
Search Latency	P50: 100–500μs (sub-millisecond)	Typically 1–50ms per query
Search Types	Vector + BM25 hybrid (weighted & RRF fusion)	Often vector-only focus
Protocols	REST + gRPC (streaming) + WebSocket	Usually REST or gRPC only
Storage	WAL + snapshots + tiered (Hot/Warm/Cold)	Not all offer WAL + crash recovery
Security	RBAC + API Keys + JWT + Audit Trail	Varies — often basic API keys only
Deployment	Single Docker container, no cloud lock-in	Many are managed-only or heavier
Observability	Prometheus + query profiling + dashboard	Depends on the product

Quick Start

Up and Running in 60 Seconds

Deploy the full stack with Docker Compose or run individual containers

Docker Compose

Recommended — runs Backend + Dashboard together

Recommended

terminal

# 1. Pull both images
docker pull ferresdb/ferres-db-core:latest
docker pull ferresdb/ferres-db-frontend:latest

# 2. Run the backend
docker run -d -p 8080:8080 \
  -e FERRESDB_API_KEYS=sk-your-key \
  -e CORS_ORIGINS=http://localhost:3000 \
  -v ferres-data:/data \
  ferresdb/ferres-db-core:latest

# 3. Run the dashboard
docker run -d -p 3000:80 \
  -e VITE_API_BASE_URL=http://localhost:8080 \
  -e VITE_API_KEY=sk-your-key \
  ferresdb/ferres-db-frontend:latest

API: http://localhost:8080

Dashboard: http://localhost:3000

Install an SDK

TypeScript

npm

pnpm add @ferresdb/typescript-sdk

Full type safety + Zod validation

WebSocket support + auto-retry

ESM & CJS exports

Python

PyPI

pip install ferres-db-python

AsyncIO with httpx

Auto-batching + structured logs

Python 3.8+ support

Developer Experience

Simple, Powerful API

From zero to vector search in under 10 lines of code

example.ts — TypeScript SDK

import { VectorDBClient, DistanceMetric } from "@ferresdb/typescript-sdk";

// Initialize client with auto-retry and timeout
const client = new VectorDBClient({{
  baseUrl: "http://localhost:8080",
  apiKey: "ferres_sk_...",
  maxRetries: 3,
});

// Create a collection with hybrid search enabled
await client.createCollection({{
  name: "documents",
  dimension: 384,
  distance: DistanceMetric.Cosine,
  enable_bm25: true,
});

// Upsert vectors with metadata (auto-batches > 1000)
await client.upsertPoints("documents", [
  { id: "doc-1", vector: [0.1, 0.2, ...], metadata: { text: "Hello" } },
]);

// Hybrid search: vector + BM25 with weighted fusion
const results = await client.hybridSearch("documents", {
  query_text: "how to deploy",
  query_vector: [0.1, 0.2, ...],
  limit: 5,
  alpha: 0.5, // 0 = BM25 only, 1 = vector only
});

Enterprise Ready

Production-Grade from Day One

Security, compliance, and operational features built-in — not bolted on.

Dual Authentication

API Keys (SHA-256 hashed, stored in SQLite) for programmatic access. JWT tokens (Argon2 passwords) for dashboard sessions.

Granular RBAC

Admin, Editor, Viewer roles with per-collection permissions. Restrict access to specific metadata fields and allowed values.

Audit Trail

Every action logged: searches, mutations, logins, user management. Daily-rotated JSONL files with user, IP, duration, and result.

Query Profiling

Use /search/explain to understand query execution. /search/estimate for cost prediction. Slow query tracking for optimization.

Metadata Filters

Rich filter operators: $eq, $ne, $in, $gt, $lt, $gte, $lte. Combine with vector search for precise, scoped retrieval.

Budget-Aware Search

Set budget_ms on any search query. Automatically fails with 422 if the latency budget is exceeded — perfect for SLA enforcement.

Community

Support us on Product Hunt

FerresDB is on Product Hunt. Your upvote and feedback help us reach more developers building AI applications. Click the badge below to visit our page.

Ready for Blazing-Fast Vector Search?

Join developers building the next generation of AI applications with FerresDB. Self-hosted, no cloud lock-in.

Get Started with Docker View on GitHub

The FastestVector Search Engine

Built for AI-Native Applications

Semantic Search

RAG Pipelines

Recommendations

Everything You Need, Nothing You Don't

Sub-Millisecond Latency

Hybrid Vector + BM25

Multi-Protocol: REST, gRPC, WebSocket, MCP

Tiered Storage (Hot/Warm/Cold)

RBAC + Audit Trail

Zero-Downtime Reindex

WAL + Snapshots

Full Observability

Official TypeScript & Python SDKs

Engineered for Production

System Layers

HNSW Parameters

Storage Layout

Performance That Speaks for Itself

Indexing Throughput

Search Latency

Why FerresDB is Fast

FerresDB vs The Rest

Up and Running in 60 Seconds

Docker Compose

Install an SDK

TypeScript

Python

Simple, Powerful API

Production-Grade from Day One

Dual Authentication

Granular RBAC

Audit Trail

Query Profiling

Metadata Filters

Budget-Aware Search

Support us on Product Hunt

Ready for Blazing-Fast Vector Search?

The Fastest
Vector Search Engine