Welcome to RAGHub, a living collection of new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. This is a community-driven project for r/RAG, where we aim to catalog the rapid growth of RAG tools and projects that are pushing the boundaries of the field.
Each day, it feels like a new tool or framework emerges, and choosing the right one is becoming more of an art than a science. Is the framework from three months ago still relevant? Or was it just hype, rehashing old concepts with a fresh look? RAGHub exists to help you stay ahead of these changes, providing a platform for the latest innovations in RAG.
This is a community project, and we welcome contributions from everyone! If you’d like to add a new framework, project, or resource, please check out our Contribution Guidelines for details on how to get started.
Name | Description | Website | Github | Stars | Activity |
---|---|---|---|---|---|
LangChain | A framework for building applications with LLMs | Website | Github | 93.2k | 9h ago |
Haystack | A framework for building search engines using neural networks | Website | Github | 17k | Last week |
LlamaIndex | A framework for building data-driven LLM applications | Website | Github | 35.9k | 7h ago |
BentoML | Build Inference APIs, LLM apps, Multi-model chains, RAG service | Website | Github | 7k | 1h ago |
Langroid | Python framework to easily build LLM-powered applications | Website | Github | 2.4k | 10h ago |
NeMo-Guardrails | Toolkit for adding programmable guardrails to LLM-based applications | Website | Github | 4k | Last week |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
Trulens | Measures and enhancse LLM app quality with feedback functions for scalable evaluation | Website | Github | 2.1k | 11h ago |
Phoenix | AI observability platform designed for experimentation, evaluation, and troubleshooting | Website | Github | 3.6k | 1d ago |
ragas | Evaluates and quantifies the performance of RAG pipelines that enhance LLM context with external data | Website | Github | 6.8k | 3h ago |
Deepchecks | Continuous validation of AI & ML models, detecting data drift and model issues | Website | Github | 3.6k | 8m ago |
AutoRAG | End-to-end RAG optimization: parsing, chunking, evaluation dataset creation, and pipeline deployment | Website | Github | 1.6k | 1h ago |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
R2R | The Elasticsearch for RAG, helps you quickly build and launch scalable RAG solutions | Website | Github | 3.4k | 6h ago |
RAGFlow | Open-source RAG engine based on deep document understanding | Website | Github | 18.7k | 1h ago |
Embedchain | Open Source Framework for personalizing LLM responses under 10 lines of code | Website | Github | 22.2k | Last week |
txtai | All-in-one embeddings database for semantic search, LLM orchestration, and RAG workflows | Website | Github | 8.8k | Last week |
dsRAG | High-performance retrieval engine for unstructured data | - | Github | 815 | Last week |
Flash-Rank | Use Pairwise or Listwise rerankers to improve search accuracy before passing to LLMs. | Github | 606 | 2w ago | |
Graphlit | API-first platform for building knowledge-driven AI applications and agents | Website | Github | 16 | 8h ago |
rag-citation | Combines RAG with automatic citation generation to enhance content credibility | Website | Github | 6 | Last week |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
LlamaParse | GenAI-native document parsing platform | Website | Github | 2.8k | 2d ago |
Langchain-extract | Web server to extract information from text and files using LLMs | Website | Github | 1k | 4m ago |
Unstructured.io | build custom preprocessing pipelines for labeling, training, or production ml | Website | Github | 8.7k | 3d ago |
Verba | RAG chatbot powered by Weaviate | Website | Github | 6.1k | 2w ago |
Unstract | No-code Platform to launch APIs and ETL Pipelines to structure unstructured documents | Website | Github | 2.3k | 4h ago |
Humata.ai | Ask questions across all of your document files | Website | 4h ago | ||
Ragie.ai | Fully managed RAG-as-a-Service for developers. | Website | Github | 12 | 12h ago |
Reducto | Parses complex documents and creates LLM-ready inputs | Website | Github | 16 | 2w ago |
Midship | Extract document data straight into your spreadsheet/erp/crm | Website | Github | - | - |
DocuPanda | Convert documents into a structured, standard set of fields and values | Website | - | - | - |
contextual-doc-retrieval-opneai-reranker | Using GPT-4 and Cohere for query expansion and re-ranking with BM25 | GitHub | 20 | Last week |
Site/Article | Description | Link |
---|---|---|
Contextual Retrieval | Anthropic introducing Contextual Retrieval | Website |
Open-RAG | Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Website |
ColPali | Efficient Document Retrieval with Vision Language Models | Website |
RAG_Techniques | Showcases various advanced techniques for RAG systems | Website |
GenAI_Agents | Tutorials and implementations for various AI Agent techniques | Website |
If you're looking for mainstream RAG frameworks and techniques**, check out the excellent repository by Nir Diamant: RAG Techniques. This repository focuses on more established tools and methods that have already gained traction in the community.
This project is licensed under the MIT License. See the LICENSE file for details.
This project is part of the r/RAG community. Have feedback or suggestions? Feel free to open an issue, start a discussion, or join the conversation on our Discord server! We want to make this repository a valuable resource for everyone exploring the RAG ecosystem, and your input is crucial.