MDDB Roadmap
Detailed roadmap showing implemented features and future plans.
Implemented Features
v1.0โv2.3 โ Core Database
- Document CRUD with metadata, collections, multi-language
- Revision history with full snapshots
- Template variables, bulk import
- gRPC/Protobuf API (16x faster than HTTP)
- Document TTL with background cleanup
- Vector search (OpenAI, Ollama embeddings)
- Webhooks with retry logic
- Full-text search (TF-IDF), import from URL
- MCP server (stdio + HTTP), Prometheus telemetry
- Schema validation, custom YAML-based MCP tools
v2.4 โ Security
- JWT authentication, API keys, bcrypt
- Collection-level RBAC (read/write/admin)
- User/group management with inherited permissions
v2.5 โ GraphQL
- GraphQL API with Playground, authentication directives
- CLI GraphQL support, Web Panel toggle
- Cohere and Voyage AI embedding providers
v2.6โv2.7 โ Search & Replication
- Advanced FTS: BM25, BM25F, PMISparse, 7 search modes, 18 languages, fuzzy, synonyms
- Hybrid search (BM25 + vector, alpha blending, RRF)
- Vector algorithms: Flat, HNSW, IVF, PQ, SQ, BQ
- Zero-shot classification
- Leader-follower replication with binlog streaming
- Aggregations (facets, histograms)
v2.8 โ Storage & Multi-Format
- Per-collection storage backends (BoltDB, memory, S3/MinIO)
- File upload (PDF, DOCX, HTML, ODT, RTF, TEX, YAML โ Markdown)
- Automation system (triggers, crons, webhooks with templates)
- Cross-collection search, duplicate detection
v2.9 โ Quantization & Infrastructure (current)
- Per-collection vector quantization (int8 = 4x, int4 = 8x compression)
- Server-Sent Events (SSE) for real-time document change notifications
- pprof profiling endpoints
- HTTP connection pooling (shared transport)
- Built-in TLS/HTTPS support
v2.9.4 โ MCP 2025-11-25 Compliance
- Protocol version:
2025-11-25(upgraded from 2024-11-05) - Streamable HTTP transport (
POST/GET /mcp) โ new standard, alongside legacy SSE - Tool annotations:
readOnlyHint,destructiveHint,idempotentHint,openWorldHinton all 52+ tools - Structured output schemas (
outputSchema) on key tools (stats, search, classification, aggregation) - Prompts: 5 built-in prompts (analyze-collection, search-help, summarize-collection, import-guide, rag-pipeline)
- Completion/autocomplete: collection names, prompt arguments, search algorithms
- MCP logging:
logging/setLevel+notifications/message(RFC 5424 levels) - Notifications:
notifications/initialized,notifications/cancelled,tools/list_changed,resources/list_changed - Progress tokens:
notifications/progressinfrastructure for long-running tools (reindex, ingest, backup) - Cursor-based pagination in
tools/listandresources/list - Error handling: tool errors return
isError: trueinstead of JSON-RPC error (per spec)
Planned Features
v2.10 โ MCP Advanced & Observability (Q2 2026)
MCP Advanced Features
- โณ OAuth 2.1 authorization for HTTP transport (PKCE, resource indicators)
- โณ Sampling โ server-initiated LLM requests (auto-classification, content summarization)
- โณ Elicitation โ server prompts user for input (form and URL modes)
- โณ Resource subscriptions (
resources/subscribe,notifications/resources/updated) - โณ Tasks โ durable state machines for long-running operations (experimental spec)
- โณ Icons on tools, resources, prompts, and serverInfo
- โณ Roots support (client workspace awareness)
- โณ JSON-RPC batch requests
Observability
- โณ OpenTelemetry / distributed tracing
- โณ Slow query logging (threshold-based)
- โณ Structured JSON logging with configurable levels
Security
- โณ Encryption at rest (AES-256-GCM on BoltDB values)
- โณ Comprehensive audit log (who/what/when)
- โณ Field-level encryption for sensitive metadata
- โณ Key rotation mechanism
Backup & Recovery
- โณ Incremental backups (binlog-based)
- โณ Point-in-time recovery (PITR)
- โณ Scheduled auto-backup (cron + S3/GCS destination)
v2.11 โ Search & AI (Q3 2026)
Geosearch โ delivered in v2.9.10
- โ Postcode/GPS-based distance search (moved to v2.9.10)
- โ Geo-bounding box queries (moved to v2.9.10)
- โ Geospatial index โ R-tree + geohash (moved to v2.9.10)
Advanced Vector Search
- โณ Cross-encoder re-ranking (two-stage retrieval)
- โณ Sparse-dense hybrid vectors (SPLADE/ColBERT)
- โณ Multi-vector documents
- โณ Streaming embeddings (embed during upload)
v3.0 โ Extensibility (Q4 2026)
Plugin System
- โณ Go plugin architecture
- โณ Custom storage backends
- โณ Custom embedding providers
- โณ Custom authentication providers
- โณ Custom search algorithms
Event Streaming
- โณ Kafka integration
- โณ NATS integration
- โณ Redis Streams support
- โณ Change Data Capture (CDC)
v3.1 โ Real-Time & GraphQL (2027)
GraphQL Subscriptions
- โณ Real-time updates via WebSocket
- โณ Document change subscriptions
- โณ Filtered subscriptions by collection/metadata
GraphQL Federation
- โณ Apollo Federation support
- โณ Subgraph schema, reference resolution
v3.2 โ Clustering & HA (2027)
Distributed Consensus
- โณ Raft-based consensus
- โณ Automatic leader election & failover
- โณ Split-brain prevention
- โณ Quorum-based writes
Sharding
- โณ Horizontal sharding by collection/key hash
- โณ Cross-shard queries
- โณ Automatic shard rebalancing
v3.3 โ Multi-Tenancy (2027)
- โณ Tenant-level data isolation
- โณ Per-tenant quotas (storage, requests, bandwidth)
- โณ Per-tenant rate limiting
- โณ Tenant provisioning API
- โณ Cross-tenant admin queries
v3.4 โ External Cache (2027)
- โณ Redis integration for distributed cache
- โณ Memcached support
- โณ Cache invalidation webhooks
- โณ Query result caching with automatic invalidation
- โณ Cache analytics and adaptive sizing
Under Consideration
- ๐ Advanced analytics dashboard
- ๐ Document relationships/links (graph queries)
- ๐ Automatic cloud backups (S3/GCS/Azure scheduled)
- ๐ GUI for schema validation rules
- ๐ Built-in image optimization
- ๐ Markdown linting and validation
- ๐ Multi-region deployment
- ๐ WebAssembly plugin support
- ๐ OpenAPI v3.1 spec auto-generation
Feedback & Suggestions
- Feature Requests:GitHub Discussions
- Bug Reports:GitHub Issues
- Contribute: See CONTRIBUTING.md