Tools/OCR & Document Processing/Docling

Featured Tool

Docling

Document parsing library by IBM for converting PDFs and documents to structured data.

Open SourceSelf HostedOffline Capable

0.0 (0)

Visit Website View on GitHub

About

Docling, from IBM Research Zurich and now hosted by the LF AI and Data Foundation, converts documents into structured representations ready for LLM and RAG pipelines. It parses PDF, DOCX, PPTX, XLSX, HTML, EPUB, images, email formats, and even audio and video files, then exports Markdown, HTML, JSON, or its DocTags format while preserving reading order. Its models perform layout analysis, table structure recognition, and formula and code extraction, with OCR for scanned pages and speech recognition for audio inputs; the compact GraniteDocling visual language model at 258M parameters handles end-to-end conversion. Everything runs locally, which matters for sensitive documents, and an API server mode is available. Native integrations cover LangChain, LlamaIndex, Haystack, and CrewAI, plus an MCP server for agent workflows. Requiring Python 3.10 or later and MIT licensed, it is used by developers building document understanding and retrieval pipelines over messy real-world files.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: OCR & Document Processing
Price: Free
Platform: Local/Desktop
Difficulty: Easy (2/5)
License: MIT
Added: Apr 3, 2026

Tags

document pdf parsing ibm structured rag markdown

Related Tools

DocTR

OCR & Document Processing

Deep learning based OCR library in Python and TensorFlow/PyTorch.

Open SourceSelf HostedOffline

Easy

0.0 (0)

MinerU

OCR & Document Processing

One-stop tool for high-quality PDF extraction to Markdown or JSON.

Open SourceSelf HostedOffline

Easy

0.0 (0)

PyMuPDF

OCR & Document Processing

Python bindings for MuPDF library for fast PDF text and image extraction.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Tabula

OCR & Document Processing

Tool for extracting tables from PDF files into CSV or DataFrame format.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Featured

EasyOCR

OCR & Document Processing

Ready-to-use OCR library supporting 80+ languages with simple Python API.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Camelot

OCR & Document Processing

Python library for extracting tables from PDF files.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Browse all OCR & Document Processing tools

Mentioned in

PDF Parsing for RAG in 2026: MinerU, Docling, Marker Compared

A benchmarked comparison of MinerU, Docling, Marker 2, Surya, PDF-Extract-Kit and Zerox for RAG ingestion,...

Billy C