Tools/OCR & Document Processing/DocTR

DocTR

Deep learning based OCR library in Python and TensorFlow/PyTorch.

Open SourceSelf HostedOffline Capable

0.0 (0)

Visit Website View on GitHub

About

docTR by Mindee is an optical character recognition library for Python built on TensorFlow and PyTorch. It splits OCR into a text detection stage that localizes words and a recognition stage that reads them, shipping pretrained models for both plus end-to-end predictor wrappers. It handles documents and natural images and exports structured results with word positions. Released under the Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: OCR & Document Processing
Price: Free
Platform: Local/Desktop
Difficulty: Easy (2/5)
License: Apache-2.0
Added: Apr 3, 2026

Tags

ocr deep-learning text-detection recognition mindee

Related Tools

Featured

Docling

OCR & Document Processing

Document parsing library by IBM for converting PDFs and documents to structured data.

Open SourceSelf HostedOffline

Easy

0.0 (0)

MinerU

OCR & Document Processing

One-stop tool for high-quality PDF extraction to Markdown or JSON.

Open SourceSelf HostedOffline

Easy

0.0 (0)

PyMuPDF

OCR & Document Processing

Python bindings for MuPDF library for fast PDF text and image extraction.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Tabula

OCR & Document Processing

Tool for extracting tables from PDF files into CSV or DataFrame format.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Featured

EasyOCR

OCR & Document Processing

Ready-to-use OCR library supporting 80+ languages with simple Python API.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Camelot

OCR & Document Processing

Python library for extracting tables from PDF files.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Browse all OCR & Document Processing tools