MDDB Roadmap

Detailed roadmap showing implemented features and future plans.

Implemented Features

v1.0โ€“v2.3 โ€” Core Database

  • Document CRUD with metadata, collections, multi-language
  • Revision history with full snapshots
  • Template variables, bulk import
  • gRPC/Protobuf API (16x faster than HTTP)
  • Document TTL with background cleanup
  • Vector search (OpenAI, Ollama embeddings)
  • Webhooks with retry logic
  • Full-text search (TF-IDF), import from URL
  • MCP server (stdio + HTTP), Prometheus telemetry
  • Schema validation, custom YAML-based MCP tools

v2.4 โ€” Security

  • JWT authentication, API keys, bcrypt
  • Collection-level RBAC (read/write/admin)
  • User/group management with inherited permissions

v2.5 โ€” GraphQL

  • GraphQL API with Playground, authentication directives
  • CLI GraphQL support, Web Panel toggle
  • Cohere and Voyage AI embedding providers

v2.6โ€“v2.7 โ€” Search & Replication

  • Advanced FTS: BM25, BM25F, PMISparse, 7 search modes, 18 languages, fuzzy, synonyms
  • Hybrid search (BM25 + vector, alpha blending, RRF)
  • Vector algorithms: Flat, HNSW, IVF, PQ, SQ, BQ
  • Zero-shot classification
  • Leader-follower replication with binlog streaming
  • Aggregations (facets, histograms)

v2.8 โ€” Storage & Multi-Format

  • Per-collection storage backends (BoltDB, memory, S3/MinIO)
  • File upload (PDF, DOCX, HTML, ODT, RTF, TEX, YAML โ†’ Markdown)
  • Automation system (triggers, crons, webhooks with templates)
  • Cross-collection search, duplicate detection

v2.9 โ€” Quantization & Infrastructure (current)

  • Per-collection vector quantization (int8 = 4x, int4 = 8x compression)
  • Server-Sent Events (SSE) for real-time document change notifications
  • pprof profiling endpoints
  • HTTP connection pooling (shared transport)
  • Built-in TLS/HTTPS support

v2.9.4 โ€” MCP 2025-11-25 Compliance

  • Protocol version: 2025-11-25 (upgraded from 2024-11-05)
  • Streamable HTTP transport (POST/GET /mcp) โ€” new standard, alongside legacy SSE
  • Tool annotations: readOnlyHint, destructiveHint, idempotentHint, openWorldHint on all 52+ tools
  • Structured output schemas (outputSchema) on key tools (stats, search, classification, aggregation)
  • Prompts: 5 built-in prompts (analyze-collection, search-help, summarize-collection, import-guide, rag-pipeline)
  • Completion/autocomplete: collection names, prompt arguments, search algorithms
  • MCP logging: logging/setLevel + notifications/message (RFC 5424 levels)
  • Notifications: notifications/initialized, notifications/cancelled, tools/list_changed, resources/list_changed
  • Progress tokens: notifications/progress infrastructure for long-running tools (reindex, ingest, backup)
  • Cursor-based pagination in tools/list and resources/list
  • Error handling: tool errors return isError: true instead of JSON-RPC error (per spec)

Planned Features

v2.10 โ€” MCP Advanced & Observability (Q2 2026)

MCP Advanced Features

  • โณ OAuth 2.1 authorization for HTTP transport (PKCE, resource indicators)
  • โณ Sampling โ€” server-initiated LLM requests (auto-classification, content summarization)
  • โณ Elicitation โ€” server prompts user for input (form and URL modes)
  • โณ Resource subscriptions (resources/subscribe, notifications/resources/updated)
  • โณ Tasks โ€” durable state machines for long-running operations (experimental spec)
  • โณ Icons on tools, resources, prompts, and serverInfo
  • โณ Roots support (client workspace awareness)
  • โณ JSON-RPC batch requests

Observability

  • โณ OpenTelemetry / distributed tracing
  • โณ Slow query logging (threshold-based)
  • โณ Structured JSON logging with configurable levels

Security

  • โณ Encryption at rest (AES-256-GCM on BoltDB values)
  • โณ Comprehensive audit log (who/what/when)
  • โณ Field-level encryption for sensitive metadata
  • โณ Key rotation mechanism

Backup & Recovery

  • โณ Incremental backups (binlog-based)
  • โณ Point-in-time recovery (PITR)
  • โณ Scheduled auto-backup (cron + S3/GCS destination)

v2.11 โ€” Search & AI (Q3 2026)

Geosearch โ€” delivered in v2.9.10

  • โœ… Postcode/GPS-based distance search (moved to v2.9.10)
  • โœ… Geo-bounding box queries (moved to v2.9.10)
  • โœ… Geospatial index โ€” R-tree + geohash (moved to v2.9.10)

Advanced Vector Search

  • โณ Cross-encoder re-ranking (two-stage retrieval)
  • โณ Sparse-dense hybrid vectors (SPLADE/ColBERT)
  • โณ Multi-vector documents
  • โณ Streaming embeddings (embed during upload)

v3.0 โ€” Extensibility (Q4 2026)

Plugin System

  • โณ Go plugin architecture
  • โณ Custom storage backends
  • โณ Custom embedding providers
  • โณ Custom authentication providers
  • โณ Custom search algorithms

Event Streaming

  • โณ Kafka integration
  • โณ NATS integration
  • โณ Redis Streams support
  • โณ Change Data Capture (CDC)

v3.1 โ€” Real-Time & GraphQL (2027)

GraphQL Subscriptions

  • โณ Real-time updates via WebSocket
  • โณ Document change subscriptions
  • โณ Filtered subscriptions by collection/metadata

GraphQL Federation

  • โณ Apollo Federation support
  • โณ Subgraph schema, reference resolution

v3.2 โ€” Clustering & HA (2027)

Distributed Consensus

  • โณ Raft-based consensus
  • โณ Automatic leader election & failover
  • โณ Split-brain prevention
  • โณ Quorum-based writes

Sharding

  • โณ Horizontal sharding by collection/key hash
  • โณ Cross-shard queries
  • โณ Automatic shard rebalancing

v3.3 โ€” Multi-Tenancy (2027)

  • โณ Tenant-level data isolation
  • โณ Per-tenant quotas (storage, requests, bandwidth)
  • โณ Per-tenant rate limiting
  • โณ Tenant provisioning API
  • โณ Cross-tenant admin queries

v3.4 โ€” External Cache (2027)

  • โณ Redis integration for distributed cache
  • โณ Memcached support
  • โณ Cache invalidation webhooks
  • โณ Query result caching with automatic invalidation
  • โณ Cache analytics and adaptive sizing

Under Consideration

  • ๐Ÿ“‹ Advanced analytics dashboard
  • ๐Ÿ“‹ Document relationships/links (graph queries)
  • ๐Ÿ“‹ Automatic cloud backups (S3/GCS/Azure scheduled)
  • ๐Ÿ“‹ GUI for schema validation rules
  • ๐Ÿ“‹ Built-in image optimization
  • ๐Ÿ“‹ Markdown linting and validation
  • ๐Ÿ“‹ Multi-region deployment
  • ๐Ÿ“‹ WebAssembly plugin support
  • ๐Ÿ“‹ OpenAPI v3.1 spec auto-generation

Feedback & Suggestions


โ† Back to README | See changelog โ†’