CapyDB is a high-level database built specifically for Large Language Model (LLM) applications. It unifies multiple database architectures—NoSQL, vector, and object storage—within a single platform, allowing seamless storage, indexing, and retrieval of structured, unstructured, and vector-based data. This makes CapyDB the ideal choice for AI-driven projects, particularly those focused on natural language processing and data analysis.
Much like how high-level programming languages like Python abstract away technical complexities to simplify development, CapyDB abstracts the complexities of different database architectures. By integrating NoSQL, vector, and object storage under one system, it provides developers with an accessible, powerful platform to manage the diverse data needs of LLM applications—without requiring expertise in multiple types of databases.
CapyDB offers several key advantages for developers:
CapyDB includes a Mongo-compatible NoSQL database for flexible document-based storage and querying, making it easy for developers familiar with MongoDB tools.
CapyDB integrates a high-performance vector database that supports:
CapyDB's object storage manages unstructured data like files and images, complementing its structured and vector data capabilities.
CapyDB extends the standard BSON (Binary JSON) format with EmbJSON (CapyDB Extended JSON), which simplifies managing and querying complex data structures like text embeddings. EmbJSON is key to CapyDB's database abstraction and is explained further in EmbJSON Overview.