I have a PDF document containing tabular or structured data with specific items and values that I need to extract efficiently into a usable data excel format. I want to automate this extraction in Alteryx, but I am unsure about the best approach or tools within Alteryx to parse the PDF and accurately capture the items and their corresponding values. Could someone provide guidance or a workflow example to help with PDF data extraction, especially for structured data fields
Hi, computer vision tools should work well for your needs, although they require the intelligence suite licence. It can help you transform the PDF context (like strings or tables) to a tabular structure. You can design the template or use auto detection. The attached workflow provides guidance on using the tool. Hope this is helpful