olmOCR
Vision-language model based OCR toolkit by AI2 for document understanding.
About
olmOCR by the Allen Institute for AI is a toolkit that uses vision-language models to convert PDFs and image-based documents into clean plain text, going beyond character recognition to account for document structure and reading order. It ships olmOCR-Bench, a benchmark of over 7,000 test cases across 1,400 documents for comparing OCR systems. Released under the Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- OCR & Document Processing
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- License
- Apache-2.0
- Minimum VRAM
- 8 GB
- Added
- Apr 3, 2026
Related Tools
Python bindings for MuPDF library for fast PDF text and image extraction.
Python library for extracting tables from PDF files.
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Turn-key OCR system for historical and non-Latin script documents.
One-stop tool for high-quality PDF extraction to Markdown or JSON.
Python library for extracting text, tables, and metadata from PDFs.