Hello,
I have extracted a pdf file which contains multiple tables.
Each table have the same column headers. I want to extract these tables in separate excel sheets.
My current output has all the rows together in one column.
A sample of my extraction looks like below image:
So, I need to extract every time the header is read and everything below it until next header comes. Then the next header in another sheet and the next one in another.
Is there a way to achieve this? I have tried using multi row formula but it does not extract rows below the header until the next header is read.
I would greatly appreciate the help here.
Thank you,
Pankhudri
Hey @Pankhudri20
You can modify this as needed, but this should accomplish the task.
I'm using some RecordID tools to split out the headers to assign my desired sheet name, then Union to bring it back together, a Multi-Row Formula tool to name the rest of the rows, Dynamic Rename and Filter to clean it up for output, and then I've configured my Output tool to write to a single Excel file while splitting the data to separate sheets based on a specific field.
Data before output:
Output configuration:
If this solves your issue please mark the answer as correct, if not let me know!
Thanks!
Phil
Hello @Maskell_Rascal
Thank you for your quick response!
The flow did work for a part of my solution.
However, I need to extract tables from a pdf file to excel.
I have attached a sample data pdf file and the desired output excel file.
The pdf contains 2 tables side by side on each page.
I need to transpose the 2nd table columns below the 1st table page wise.
This is proving to be quite challenging when I do it for multiple pages.
Can you please tell me how to do that?
It would be a great help!
Thank you
Pankhudri
User | Count |
---|---|
17 | |
15 | |
15 | |
8 | |
6 |