Hi,
I am having some difficulty parsing text (from a PDF). And my AIS PDF Tools are missing (ugh) after updating to 2021.2. It's just a Monday.
Workflow attached. Sample page image below.
A few things...
- I need to remove all the header and footer information. Referenced Record IDs 145-149 and 189-192. The first 3 of each group are the header. The last two of each group is the footer (date and page number). These need to removed from the entire document. With the exception of the page number, the other 4 values are constant.
- I need to pull out the outline. Each bit of the outline will be its own line starting with(^C\..*?$). [Reference RecordID 162] Subsections will be delineated by \([a-z]\). [Reference RecordID 163]
- I then need to group (concat) all the lines of text that correspond with an outline section. [Refernce RecordIDs 162-167)
Thanks
