Tools/OCR & Document Processing/pdf2image

pdf2image

Simple Python library to convert PDF pages to images using Poppler.

Open SourceSelf HostedOffline Capable

0.0 (0)

Visit Website View on GitHub

About

pdf2image is a small Python module that converts PDF pages into PIL Image objects by wrapping the Poppler utilities pdftoppm and pdftocairo. It is commonly used as a preprocessing step before OCR or image analysis and has minimal Python dependencies, though it requires Poppler installed on the system. It supports page ranges, DPI control, and several output formats. Released under the MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: OCR & Document Processing
Price: Free
Platform: Local/Desktop
Difficulty: Beginner (1/5)
License: MIT
Added: Apr 3, 2026

Tags

pdf image conversion poppler preprocessing simple

Related Tools

Featured

Docling

OCR & Document Processing

Document parsing library by IBM for converting PDFs and documents to structured data.

Open SourceSelf HostedOffline

Easy

0.0 (0)

DocTR

OCR & Document Processing

Deep learning based OCR library in Python and TensorFlow/PyTorch.

Open SourceSelf HostedOffline

Easy

0.0 (0)

MinerU

OCR & Document Processing

One-stop tool for high-quality PDF extraction to Markdown or JSON.

Open SourceSelf HostedOffline

Easy

0.0 (0)

PyMuPDF

OCR & Document Processing

Python bindings for MuPDF library for fast PDF text and image extraction.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Tabula

OCR & Document Processing

Tool for extracting tables from PDF files into CSV or DataFrame format.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Camelot

OCR & Document Processing

Python library for extracting tables from PDF files.

Open SourceSelf HostedOffline

Beginner

0.0 (0)

Browse all OCR & Document Processing tools