All
I am using the image input tool to read the text from pages of a PDF file, and then reg-ex to locate employee numbers on the pages



However some of the pages are not reading fully so when the data is turned to text, the employee numbers are missing (see nulls below)


Ive examined the full process and the problem seems to be at the image to text phase - the Employee numbers are present on all pdf pages, however some are not making it to the text output.
I think possibly the image to text step is not taking in the full page, maybe due to its dimensions
Is anyone familiar with this problem? Is there a way to set the size of the image that the Image to Text step pulls in?
Thanks in advance
Dave