Adobe PDF Extract API Tutorial
Adobe PDF Extract API Tutorial
Pre-Setup
1. MAKING SURE PYTHON IS IN A PROPER PATH
Enter run into search tab on taskbar on windows
Copy and paste sysdm.cpl into search bar
Go to environmental variables, ensure you input the path to your python scripts.
Clicking the link at the beginning and logging into your Adobe account, you should be presented
with the following (above).
- Input a name
- Click Python extract
- Agree to terms
Notice how I have placed the code OUTSIDE of the adobe folder, for easy access.
You will also need to download the requirements.txt file from my folder, or from the GitHub
4. Opening the script in Visual Studio Code
In MS Visual Studio Code (once installed) open the pdf_extract.py file
This is where you will put your own information/path regarding where you are keeping the
Adobe folder.
In the Adobe folder, you place the PDFs you want to work on in the resources folder. You will get your
data out in the output folder.
List the names, in quotes and with commas, the files you want to use (name them something easy) in
listfiles. The list output name is the folder name with your extracted data.
Change directories in Visual Studio code to that of the folder containing the script
Use the terminal and type:
cd .. to move backwards
and cd Path.name.here to move forwards
Use the Run button on top of Visual Studio code, do without debugging.
Go to outputs in your Adobe folder and get your data. Sigh in relief.