OCR & Document Processing AI Tools
Open-source optical character recognition engines and document parsing tools for extracting text and structure from images and PDFs.
Open-source optical character recognition engines and document parsing tools for extracting text and structure from images and PDFs.
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Most widely-used open-source OCR engine supporting 100+ languages.
Document parsing library by IBM for converting PDFs and documents to structured data.
Multilingual OCR toolkit by PaddlePaddle with state-of-the-art accuracy.
Turn-key OCR system for historical and non-Latin script documents.
One-stop tool for high-quality PDF extraction to Markdown or JSON.
Multilingual document OCR toolkit with line detection and layout analysis.
Adds searchable text layer to scanned PDFs using Tesseract OCR.
Converts PDFs to clean Markdown with high accuracy for text, tables, and equations.
Neural OCR model by Meta for academic documents and mathematical expressions.
Open-source library for preprocessing unstructured documents for LLM pipelines.
Tool for extracting tables from PDF files into CSV or DataFrame format.
General OCR Theory model with unified end-to-end architecture for various OCR tasks.
Simple Python library to convert PDF pages to images using Poppler.
End-to-end OCR model using vision-language architecture.
OCR tool specialized in recognizing mathematical equations and LaTeX.
Vision-language model based OCR toolkit by AI2 for document understanding.
Deep learning based OCR library in Python and TensorFlow/PyTorch.
Python library for extracting text, tables, and metadata from PDFs.
Python bindings for MuPDF library for fast PDF text and image extraction.
Python library for extracting tables from PDF files.