Camelot
Python library for extracting tables from PDF files.
About
Camelot is a Python library for extracting tables from text-based PDF files into pandas DataFrames without OCR. It offers two parsing flavors, lattice for ruled tables and stream for whitespace-separated ones, and exposes accuracy metrics so results can be tuned for complex layouts. An interactive quickstart notebook is provided. Released under the MIT license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- OCR & Document Processing
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Beginner (1/5)
- License
- MIT
- Added
- Apr 3, 2026
Related Tools
Python library for extracting text, tables, and metadata from PDFs.
Python bindings for MuPDF library for fast PDF text and image extraction.
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Turn-key OCR system for historical and non-Latin script documents.
One-stop tool for high-quality PDF extraction to Markdown or JSON.
Vision-language model based OCR toolkit by AI2 for document understanding.