OCR & Document Processing AI Tools
Open-source optical character recognition engines and document parsing tools for extracting text and structure from images and PDFs.
Open-source optical character recognition engines and document parsing tools for extracting text and structure from images and PDFs.
Document parsing library by IBM for converting PDFs and documents to structured data.
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Multilingual OCR toolkit by PaddlePaddle with state-of-the-art accuracy.
Most widely-used open-source OCR engine supporting 100+ languages.
Vision-language model based OCR toolkit by AI2 for document understanding.
Converts PDFs to clean Markdown with high accuracy for text, tables, and equations.
Neural OCR model by Meta for academic documents and mathematical expressions.
Open-source library for preprocessing unstructured documents for LLM pipelines.
Python library for extracting tables from PDF files.
Tool for extracting tables from PDF files into CSV or DataFrame format.
Python library for extracting text, tables, and metadata from PDFs.
General OCR Theory model with unified end-to-end architecture for various OCR tasks.
Turn-key OCR system for historical and non-Latin script documents.
Python bindings for MuPDF library for fast PDF text and image extraction.
Simple Python library to convert PDF pages to images using Poppler.
End-to-end OCR model using vision-language architecture.
OCR tool specialized in recognizing mathematical equations and LaTeX.
Deep learning based OCR library in Python and TensorFlow/PyTorch.
Multilingual document OCR toolkit with line detection and layout analysis.
Adds searchable text layer to scanned PDFs using Tesseract OCR.
One-stop tool for high-quality PDF extraction to Markdown or JSON.