RAG & Document Retrieval AI Tools
Open-source retrieval-augmented generation pipelines, document loaders, web scrapers, and local knowledge base tools.
Open-source retrieval-augmented generation pipelines, document loaders, web scrapers, and local knowledge base tools.
Open-source RAG engine with deep document understanding and chunk-level citations.
Production-ready AI project for private document interaction without data leaving your environment.
All-in-one desktop and Docker app for private LLM chat with your documents.
AI-powered web scraping library using LLMs for intelligent data extraction.
Chat with documents locally using any open-source LLM without data leaving your device.
Open-source RAG chatbot by Weaviate for exploring datasets and documents.
Managed vector database for machine learning applications
Open-source data extraction and indexing engine for RAG applications.
Local RAG capabilities in GPT4All for chatting with documents privately.
Lightweight library for using reranking models to improve search results.
Python toolkit for reproducing and developing RAG research.
Library for building stateful multi-agent applications with LLMs.
Visual pipeline builder for Haystack RAG and search applications.
AI-native vector database with built-in ML modules
Self-hosted application layer for LLMs with chat, RAG, web search, code execution, and agents.
Python library that simplifies training and using ColBERT for late-interaction retrieval in RAG.
Open-source AI second brain for chatting with documents and knowledge bases.
Modular open-source RAG framework for building production document retrieval applications.
Cloud-native vector database for scalable similarity search
Web scraping and browser automation library by Apify for Node.js and Python.
Web scraping API that turns websites into clean LLM-ready markdown.