Zappy

PDF Data Extraction using python

Zappy.ZappyActions.PDF

Extract the PDF file content into an excel file.

Properties

INPUT

File Type – PDF File Type like lattice or stream.

Stream – Stream can be used to parse tables that have white spaces between cells to simulate a table structure.
Lattice – Lattice is more deterministic in nature, and it does not rely on guesses. It can be used to parse tables that have demarcated lines between cells, and it can automatically parse multiple tables present on a page.

INPUT

InputFileName – PDF file path.

Pages – Page Number with comma separated like (1,2, 3, …).

Passwd – Pdf File Password if file is password Protected.

PythonExePath – Python exe path for running the script.

PythonScriptPath – Path of the PythonScript where the python script is saved.

OUTPUT

OutputFilePath – Created excel (.xlsx) file path.

PDF Data Extraction using python


Zappy.ZappyActions.PDF

Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.