Read PDF and Extract Data in Tabular Format
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi,
I have a use case where I get a PDF file from client and extract the data from it and populate details in an excel sheet with prefixed headers. I have to automate this process end to end attached is the dummy PDF file for your reference.
Here are few things I want to acheive:
1. Extract the data from PDF to Excel
2. I read somewhere that it requires OCR tool integration which is not supported in my org. so is there a way to convert the PDF to text or docx or any other readable format from where I have pull the details in excel template.
Any help on this is massively appreciated.
Thanks,
Swati
Solved! Go to Solution.
- Labels:
- Datasets
- Developer Tools
- Input
- Workflow
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @Sasthana25 ,
you will need either the Intelligence Suite module for Alteryx, or the pdf macro I've used in the attached example.
I obviously don't know what format you want the data but I've attached a workflow you can mess around with.
This yields the following results:
I hope this helps,
M.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Sorry - i guess the link isnt working :
The page you were looking for doesn't exist.
You may have mistyped the address or the page may have moved.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Unfortunately, I am not on the latest version of Alteryx:
Failure to Import
This workflow was created by a more recent version of Alteryx, and may contain tools or functionality not present in this version. Alteryx does not support using an earlier version of Alteryx to open a workflow created with a newer version. For best results, download the latest version of Alteryx.
Any suggestion how can I view your workflow and apply the logic here - Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
