olmOCR
Vision-language model based OCR toolkit by AI2 for document understanding.
Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)
About
olmOCR by AI2 (Allen Institute for AI) uses vision-language models for document understanding and OCR. Goes beyond traditional OCR by understanding document structure, context, and semantics. Part of the OLMo family. Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- OCR & Document Processing
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- License
- Apache-2.0
- Minimum VRAM
- 8 GB
- Added
- Apr 3, 2026
Similar Tools
Featured
Most widely-used open-source OCR engine supporting 100+ languages.
Open SourceSelf HostedOffline
Easy
0.0 (0)
Featured
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
Featured
Multilingual OCR toolkit by PaddlePaddle with state-of-the-art accuracy.
Open SourceSelf HostedOffline
Easy
0.0 (0)