DocTR
Deep learning based OCR library in Python and TensorFlow/PyTorch.
About
docTR by Mindee is an optical character recognition library for Python built on TensorFlow and PyTorch. It splits OCR into a text detection stage that localizes words and a recognition stage that reads them, shipping pretrained models for both plus end-to-end predictor wrappers. It handles documents and natural images and exports structured results with word positions. Released under the Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- OCR & Document Processing
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Easy (2/5)
- License
- Apache-2.0
- Added
- Apr 3, 2026
Related Tools
Python library for extracting text, tables, and metadata from PDFs.
Python bindings for MuPDF library for fast PDF text and image extraction.
Python library for extracting tables from PDF files.
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Turn-key OCR system for historical and non-Latin script documents.
Vision-language model based OCR toolkit by AI2 for document understanding.