olmOCR

Vision-language model based OCR toolkit by AI2 for document understanding.

Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)

About

olmOCR by AI2 (Allen Institute for AI) uses vision-language models for document understanding and OCR. Goes beyond traditional OCR by understanding document structure, context, and semantics. Part of the OLMo family. Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Minimum VRAM
8 GB
Added
Apr 3, 2026

Similar Tools

Featured

Most widely-used open-source OCR engine supporting 100+ languages.

Open SourceSelf HostedOffline
Easy
0.0 (0)
Featured

Ready-to-use OCR library supporting 80+ languages with simple Python API.

Open SourceSelf HostedOffline
Beginner
0.0 (0)
Featured

Multilingual OCR toolkit by PaddlePaddle with state-of-the-art accuracy.

Open SourceSelf HostedOffline
Easy
0.0 (0)