olmOCR

Vision-language model based OCR toolkit by AI2 for document understanding.

Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)

About

olmOCR by the Allen Institute for AI is a toolkit that uses vision-language models to convert PDFs and image-based documents into clean plain text, going beyond character recognition to account for document structure and reading order. It ships olmOCR-Bench, a benchmark of over 7,000 test cases across 1,400 documents for comparing OCR systems. Released under the Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Minimum VRAM
8 GB
Added
Apr 3, 2026

Related Tools

Python bindings for MuPDF library for fast PDF text and image extraction.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Python library for extracting tables from PDF files.

Open SourceSelf HostedOffline
Beginner
0.0 (0)
Featured

Ready-to-use OCR library supporting 80+ languages with simple Python API.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Turn-key OCR system for historical and non-Latin script documents.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

One-stop tool for high-quality PDF extraction to Markdown or JSON.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Python library for extracting text, tables, and metadata from PDFs.

Open SourceSelf HostedOffline
Beginner
0.0 (0)
Browse all OCR & Document Processing tools