RAG-Forge
Framework-agnostic CLI toolkit for production-grade RAG pipelines with evaluation baked in.
RAG-Forge gives you the building blocks for retrieval-augmented generation — ingestion, chunking, hybrid retrieval, evaluation, drift detection, cost tracking, and observability — wired together with a single CLI. No framework lock-in. Bring your own LLM provider, your own vector store, your own embeddings.
Start here
- Installation — Get the CLI running in under a minute.
- Quick Start — Index a corpus and run your first query.
- Core concepts — The mental model before you build.
Reference
- CLI Reference — Every
rag-forgecommand, every flag. - Templates — Scaffolds for common RAG patterns.
- Concepts — How chunking, retrieval, and evaluation work under the hood.
- MCP Server — Expose RAG-Forge to Claude Desktop and other MCP clients.
Why RAG-Forge
RAG is hard to get right. Most teams ship a prototype, watch it break on real queries, and have no framework for diagnosing why. RAG-Forge is built around the RAG Maturity Model (RMM) — a 0-to-5 scale that turns vague “our RAG is broken” conversations into specific, measurable improvements. Every command is designed to move you up one level.