We have a situation where we are trying to turn PDFs of bank statements into Excel Files. The structure of the file is such that there is header information (date, acct number, page #, etc..), then sections with a title row such as "Deposits and Additions", "Checks and Withdrawals", "Checks in Serial Number Order", and "Daily Balance Info". Each section is structured into nicely laid out columns with headers, but the sections have different column layouts. The number of rows in each section are variable based on the number of transactions. I have been able to turn the PDFs into Text files. But where I am getting stuck is trying to extract the different sections and then combining into a table. Has anyone been able to do something similar where you search through a text file for a phrase and then grab a variable amount of rows that occur before the next phrase? If needed I can put a sample file up for review.
Thanks in advance.