PDF recognition often misses several pages.

#33

by xldistance - opened Oct 23, 2025

Discussion

xldistance

Oct 23, 2025

•

edited Oct 23, 2025

nanonets_ocr doesn't have this problem

            images = list(convert_from_path(
                pdf_path,
                poppler_path=r"H:\\Langchain-Chatchat0.3\\poppler-24.08.0\\Library\\bin",
            ))

The image has already been converted to the RGB mode.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment