Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Help Needed: Formatting PDF Text to Table Using RegEx

buddhiDB
7 - Meteor

Hi Community,

I've extracted several PDFs using the "PDF to Text" tool with the "Lines" method. The extracted result is attached.

I attempted to format the output into a table using RegEx to match the structure shown in the attached image. However, I'm not getting the expected results.

I've also attached the workflow I used for testing. I'd really appreciate it if anyone could help me resolve this issue.

 

 

 

4 REPLIES 4
aatalai
15 - Aurora

@buddhiDB I brought the image in again and used a table extraction (that might help at a pdf level) and giving it a template. This might be a nice workaround, also attacehd a workflow, please let me know how you get on

 

Screenshot 2025-07-10 082223.png

buddhiDB
7 - Meteor

@aatalai  Thank you so much for your quick response and the work you’ve done for me.

In my case, I have many PDFs, and each PDF contains this specific table on the second page. I need to extract that table from each file. Without relying on images, is there any way to extract the table specifically from the second page using the image template as a reference?

aatalai
15 - Aurora

@buddhiDB use the pdf to text and set the template that way, You can also filter to only look at the second page of your input too. I've attached an example from Alteryx

 

hard to give a clearly example with out data

Screenshot 2025-07-10 094505.png

 

 

Screenshot 2025-07-10 094318.png

buddhiDB
7 - Meteor

Hi @aatalai ,

Could you please help me format the PDF extraction results in the attached Excel file?

  • The "Table Method" tab contains results extracted using the Table method.

  • The "Line Method" tab contains results extracted using the Lines method.

  • For your reference, I’ve also included table snips from each PDF in the same attachment.

  • The "Expected Result" tab shows how we would like the final output to look.

Additionally, I’ve attached a full snip of page 2 from one of the PDFs so you can see the table layout clearly.

Labels
Top Solution Authors