Note The agentic-doc Python library is now legacy. Please migrate to the new landingai-ade library, which is now the official Python library for Agentic Document Extraction and supports our newer API ...
Abstract: There is a sudden increase in digital data as well as a rising demand for extracting text efficiently from images. These two led to full optical character recognition systems are introduced ...
Converts every page of a given PDF file into an image-based page and applies random askew and brightness (very mild) effects to simulate the appearance of scanned documents. The resulting pages are ...