Cosdata Roadmap
Detailed timeline and feature status for our journey from MVP to production-ready enterprise solution
Navigation
Timeline
Feature Status
CompletedFeatures we've finished
In ProgressCurrently working on
TodoComing next
Development Timeline
December 15, 2024
MVP/Alpha: Vector Database
- Optimized HNSW (dense) and Inverted index (sparse) implementations
- RESTful API for core operations
- SIMD optimized major distance metrics and quantization
- Versioning & "Transaction-as-a-resource"
- Run basic comparison benchmarks for HNSW & Inverted index (SPLADE)
January 30, 2025
MVP/Alpha: Graph Database
- Basic graph data structures and operations
- Simple integration with vector search
- Rudimentary CosQL for graph queries
March 15, 2025
Beta: Graph Database
- Advanced graph algorithms and knowledge graph features
- Enhanced CosQL with graph-specific operations
- Basic rule evaluation engine
June 15, 2025
RC/GA: Graph Database
- Full graph database capabilities
- Seamless integration of graph and vector search
- Advanced knowledge graph operations and querying
June 15, 2025
Beta: Cloud Services
- Multi-cloud support and improved resource management
- Enhanced monitoring and basic serverless functions
- Development of comprehensive web application
- Initial integration with major cloud ecosystems
August 30, 2025
RC/GA: Cloud Services
- Fully automated deployment and scaling
- Production-ready web application with full feature set
- Comprehensive management and analytics interface
- High availability and redundancy features
- Complete integration with major cloud ecosystems
Feature Status
Indexing and Search
| Feature | Status | Phase |
|---|---|---|
| HNSW indexing for dense vectors with high dimensionality support | Completed | MVP/ALPHA |
| Inverted Index for sparse vectors (Splade & BM25), supporting very high dimensionality | Completed | MVP/ALPHA |
| ANN probabilistic search for Inverted Index | Completed | MVP/ALPHA |
| Benchmarking Inverted Index against proprietary data type offerings | In Progress | BETA |
| Optimized hybrid search algorithms | Todo | MVP/ALPHA |
| Advanced indexing optimizations | Todo | BETA |
| Complete end-to-end comparison benchmarking of HNSW & Inverted Index | Todo | BETA |
| Implement re-ranker integration | Todo | RC/GA |
Distance Metrics and Quantization
| Feature | Status | Phase |
|---|---|---|
| Dot product | Completed | MVP/ALPHA |
| Cosine Similarity | Completed | MVP/ALPHA |
| Euclidean | Completed | MVP/ALPHA |
| Hamming | Todo | MVP/ALPHA |
| SIMD optimizations for cosine & dot product metrics | Completed | MVP/ALPHA |
| Binary (base 2) quantization | Completed | MVP/ALPHA |
| Quaternary (base 4) quantization | Completed | MVP/ALPHA |
| Octal (base 8) quantization | Completed | MVP/ALPHA |
| U8 (base 256) quantization | Completed | MVP/ALPHA |
| Sub-Byte Quantization of Inverted Index | In Progress | BETA |
| SIMD optimizations for all quantization methods | In Progress | RC/GA |
| Implementing auto-configuration for optimal quantization and storage based on statistical sampling | In Progress | BETA |
Storage and Performance
| Feature | Status | Phase |
|---|---|---|
| Buffered IO, equivalent to memory mapped files for efficient caching | Completed | MVP/ALPHA |
| Custom storage layer with serialization of index and corresponding file formats | Completed | MVP/ALPHA |
| Lazy Loading of index nodes, fulfilling DiskANN requirements for low memory use | Completed | MVP/ALPHA |
| LRU cache for lazy loaded items | Completed | MVP/ALPHA |
| Separation of compute & storage architecture | Completed | MVP/ALPHA |
| Advanced caching strategies | Todo | BETA |
| Distributed storage support | Todo | RC/GA |
| Implement advanced sharding for multi-billion scale datasets | Todo | RC/GA |
| Enhance high availability and redundancy features | Todo | RC/GA |
Data Management and Versioning
| Feature | Status | Phase |
|---|---|---|
| Versioning with transaction-based historical revisions and branching | Completed | MVP/ALPHA |
| Lazy loadable collections (Set, Map, Vec, Array, EagerLazyLoad, etc) | Completed | MVP/ALPHA |
| Auto creation of indexes | Completed | MVP/ALPHA |
| Advanced versioning features, like branching & related APIs | Todo | BETA |
| Improve usability of versioning system | Todo | BETA |
| Multi-modal data support | Todo | RC/GA |
| Add native support for storing documents and multi-modal data types | Todo | RC/GA |
Query and API
| Feature | Status | Phase |
|---|---|---|
| RESTful API (upsert, ANN, collection create, create index) | Completed | MVP/ALPHA |
| Developing user-facing RESTful API for Inverted Index | In Progress | BETA |
| Integrating HNSW hyperparameters API | In Progress | BETA |
| GraphQL API support | Todo | RC/GA |
| Implement metadata filtering | Todo | BETA |
Graph Database and Knowledge Graph
| Feature | Status | Phase |
|---|---|---|
| Cos Query Language (CosQL) specification | Completed | MVP/ALPHA |
| Rule, Fact, Schema parser for data definition, manipulation & querying | Completed | MVP/ALPHA |
| Rule evaluation engine (detailed design document created) | Completed | MVP/ALPHA |
| Enhanced CosQL features | Todo | BETA |
| Enhance graph database rule evaluation engine and improve performance | Todo | BETA |
| Integrate LLM/model for natural language querying of knowledge graphs and relational data | Todo | RC/GA |
| Implement Agentic Memory capabilities | Todo | RC/GA |
Cloud Integration and Web Application
| Feature | Status | Phase |
|---|---|---|
| Prototype web-based management interface | Todo | MVP/ALPHA |
| Begin development of comprehensive web application | Todo | BETA |
| Implement basic serverless functions | Todo | BETA |
| Integrate with major cloud ecosystems (initial) | Todo | BETA |
| Release production-ready web application | Todo | RC/GA |
| Implement advanced serverless functions | Todo | RC/GA |
| Fully integrate with major cloud ecosystems | Todo | RC/GA |
| Develop comprehensive analytics features in web application | Todo | RC/GA |
Integration and Ecosystem
| Feature | Status | Phase |
|---|---|---|
| Integrate with major text and image vectorization models | Todo | RC/GA |
| Integrate with LangChain, LlamaIndex, and similar frameworks | Todo | RC/GA |
| Develop web application and cloud serverless integration with major ecosystems | Todo | RC/GA |
Security and Access Control
| Feature | Status | Phase |
|---|---|---|
| Develop authentication and IAM user roles for filtering/joining HNSW and Inverted indexes | Todo | RC/GA |
Ongoing Improvements
| Feature | Status | Phase |
|---|---|---|
| Ongoing bug fixes and performance improvements | In Progress | ALL PHASES |