Hi All,
I have an interesting problem I hope you might be able to shed some light on -
I have a workflow that separates PDF files into individual files based on employee number. It works fine and is easily adaptable for different file layouts etc.


However I have now been asked to adjust the flow to separate some Canadian year end files. The problem is, each individual employee PDF contains the same information doubled, the page essentially split in half with the data mirrored. This causes a problem when splitting as there are no singular identifiers on the page - everything appears twice.
Is there a way to only take the FIRST instance of a string (say RPC/RRQ) from each page and split on that first instance?
Any help would be appreciated.
Dave