Overview
- RAGdb proposes a zero-dependency, single-file RAG container using SQLite and ONNX to replace cloud vector databases and heavy deep learning runtimes for edge and privacy-constrained deployments.
- On a consumer Intel i7-1165G7 laptop, RAGdb reports 100% Recall@1 for entity retrieval, 31.6x faster incremental ingestion than cold starts, and roughly 99.5% smaller disk footprint than Docker-based stacks.
- SmartChunk introduces a planner that selects chunk abstraction per query and a lightweight compression module that produces high-level embeddings without repeated summarization to improve long-document QA.
- The SmartChunk paper reports state-of-the-art accuracy with lower cost across five QA benchmarks plus an out-of-domain set, enabled by a reinforcement-learning scheme called STITCH.
- A comparative study on an e-commerce SKB (STaRK) finds retriever–reranker pipelines over knowledge graphs improve Hit@1 by 20.4% and MRR by 14.5% versus published benchmarks.