Hi Community,
Data-->emp1-->expense_report1-->bill1
bill2
bill3
expense_report2-->bill34
23
emp2-->expensereport_344
expensereport_454
emp3-->expensereport_345
above is the structure how bills and invoices are present in a folder for each employees.
i have to extract text from the images, pdf and then compare all bills for a particular employees with each other to find duplicacy.
problem i am facing is when i am using image input and image to text tool it is giving the some memory error and unable to extract the text.(there are around 2800 bills)
what approach should i use to make this workflow?