Hello,
within the Alteryx Knowledge section I read about: ' Can Alteryx Parse A Word Doc Or PDF?' which was very helpful to get started. Now I am facing some more challenging topics:
I have hundreds of contractual relevant documents like Acceptance Sheets and Change Requests where I need to pick relevant commercial data on a regular basis. These documents are stored on our SharePoint in PDF format. Unfortunately the PDF files are protected. So what is working on a manual basis in my test environment:
- print the protected PDF with a PDF printer into a non protected version (in some cases they need to be unlocked with another tool beforehand)
- save the non protected PDF into plain text format
- run the Alteryx workflow to collect the relevant data
My qustions are:
- Does anyone see any chances to automate the entire workflow with alteryx?
- The DOCTOTEXT tool mentioned in the Knowlege section is not working for me from the start. Are there any tools known that I could use for command line usage to automate the entire workflow?
- Any other alteratives to solve the situation? Manual transformation is not an option for us as this is an onging requirement
Any input is highly appreciated.