OCRFlux-3B
End-to-end OCR model using vision-language architecture.
Open SourceSelf HostedOffline CapableGPU Required (6GB+ VRAM)
0.0 (0)
About
OCRFlux-3B is an end-to-end OCR model using a vision-language architecture for document understanding. Handles diverse document types including forms, receipts, and tables. 3B parameter model.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- OCR & Document Processing
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- Minimum VRAM
- 6 GB
- Added
- Apr 3, 2026
Related Tools
Python library for extracting text, tables, and metadata from PDFs.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
Python bindings for MuPDF library for fast PDF text and image extraction.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
Python library for extracting tables from PDF files.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
Featured
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
Turn-key OCR system for historical and non-Latin script documents.
Open SourceSelf HostedOffline
Intermediate
0.0 (0)
Vision-language model based OCR toolkit by AI2 for document understanding.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)