Marker

Converts PDFs to clean Markdown with high accuracy for text, tables, and equations.

Open SourceSelf HostedOffline CapableGPU Required (4GB+ VRAM)
0.0 (0)

About

Marker by Vik Paruchuri converts PDFs and other documents into clean Markdown, JSON, chunks, or HTML quickly and accurately, handling text, tables, LaTeX equations, and images through a pipeline of layout detection, OCR, and formatting models. A managed platform and batch service are also offered. The open-source tool is released under the GPL-3.0 license, with a separate commercial self-hosting license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
GPL-3.0
Minimum VRAM
4 GB
Added
Apr 3, 2026

Related Tools

Python library for extracting text, tables, and metadata from PDFs.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Python bindings for MuPDF library for fast PDF text and image extraction.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Python library for extracting tables from PDF files.

Open SourceSelf HostedOffline
Beginner
0.0 (0)
Featured

Ready-to-use OCR library supporting 80+ languages with simple Python API.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Turn-key OCR system for historical and non-Latin script documents.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

Vision-language model based OCR toolkit by AI2 for document understanding.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Browse all OCR & Document Processing tools