Camelot

Python library for extracting tables from PDF files.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

Camelot is a Python library for extracting tables from text-based PDF files into pandas DataFrames without OCR. It offers two parsing flavors, lattice for ruled tables and stream for whitespace-separated ones, and exposes accuracy metrics so results can be tuned for complex layouts. An interactive quickstart notebook is provided. Released under the MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Beginner (1/5)
License
MIT
Added
Apr 3, 2026

Related Tools

Python library for extracting text, tables, and metadata from PDFs.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Python bindings for MuPDF library for fast PDF text and image extraction.

Open SourceSelf HostedOffline
Beginner
0.0 (0)
Featured

Ready-to-use OCR library supporting 80+ languages with simple Python API.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Turn-key OCR system for historical and non-Latin script documents.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

One-stop tool for high-quality PDF extraction to Markdown or JSON.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Vision-language model based OCR toolkit by AI2 for document understanding.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Browse all OCR & Document Processing tools