Particle.news

New ArXiv RAG Papers Tackle Lean Stacks, Adaptive Retrieval, and Knowledge-Graph QA

All results are author-reported preprints awaiting replication.

Overview

  • RAGdb proposes a zero-dependency, single-file RAG container using SQLite and ONNX to replace cloud vector databases and heavy deep learning runtimes for edge and privacy-constrained deployments.
  • On a consumer Intel i7-1165G7 laptop, RAGdb reports 100% Recall@1 for entity retrieval, 31.6x faster incremental ingestion than cold starts, and roughly 99.5% smaller disk footprint than Docker-based stacks.
  • SmartChunk introduces a planner that selects chunk abstraction per query and a lightweight compression module that produces high-level embeddings without repeated summarization to improve long-document QA.
  • The SmartChunk paper reports state-of-the-art accuracy with lower cost across five QA benchmarks plus an out-of-domain set, enabled by a reinforcement-learning scheme called STITCH.
  • A comparative study on an e-commerce SKB (STaRK) finds retriever–reranker pipelines over knowledge graphs improve Hit@1 by 20.4% and MRR by 14.5% versus published benchmarks.