Help with better workflow including more data in output

Question

Hi,

I have been asking bits and parts to a PDF parsing solution and data to be included in the output. And I credit the success of this journey to you all

@Qiu @apathetichell @Ben_H @PhilipMannering @pedrodrfaria Thank you! Your help has been invaluable!

And I have managed to get the output close to what the demands are. However, I have further requests which I am hoping to get help as I am totally out of my depth here.

The output needs to be further broken down as per the "Account" i.e. RecordID (Tool ID: 75):  63, 78, 124,  135, 163, etc.

i.e. the filter output of Tool ID: 91

I need to bring this information as a part of my output. Probably a good brain-picking exercise if you guys call it.
Can you help?
If you have any questions, please ask and I will be prompt to respond.

Thanks Again!
Please find workflow attached.

Workflow.yxmd

HW1 · Answer

@Qiu  Sure!
Please find PDF attached and the workflow that has the PDF parser which is used for parsing the pdf

Settings for the parsing.

The PDF used in this example is attached below.

The data ideally expected is in the format in the PDF and is very close to the output of the workflow you have helped me build.

I will also try to work this on my end. The PDF from this account are in a standard format and does not change. The output will be used as a .tsv format which will  be uploaded into an accounting system.

Thank you so much!

Please let me know if you have further questions.

JJ Richards SA - Multiple Sites - 270221 - 300083742102              x.pdf

Qiu · Answer

@HW1

The text mining is always challenging given usually unstructured data.
But would it be possible for you to provide a original PDF pages and sample output format?

Lets give a try.