pdf2image
Simple Python library to convert PDF pages to images using Poppler.
About
pdf2image is a small Python module that converts PDF pages into PIL Image objects by wrapping the Poppler utilities pdftoppm and pdftocairo. It is commonly used as a preprocessing step before OCR or image analysis and has minimal Python dependencies, though it requires Poppler installed on the system. It supports page ranges, DPI control, and several output formats. Released under the MIT license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- OCR & Document Processing
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Beginner (1/5)
- License
- MIT
- Added
- Apr 3, 2026
Related Tools
Python library for extracting text, tables, and metadata from PDFs.
Python bindings for MuPDF library for fast PDF text and image extraction.
Python library for extracting tables from PDF files.
Ready-to-use OCR library supporting 80+ languages with simple Python API.
Turn-key OCR system for historical and non-Latin script documents.
Vision-language model based OCR toolkit by AI2 for document understanding.