I've got a use case which requires the PDF to Text function and OCR capabilities. Thing is, the file is not standardized due to human writing being involved, which means cursive and unintelligible handwriting sometimes over the printed parts of the file.
End Goal is to parse out certain information from the file - I've done a few and got some results, but I'd say it's about 10% of the full stack...
How would one handle such a use case? Are there any examples out there from Maveryx community?
P.S: Sorry I cannot share the PDFs, they contain sensitive PII information that I cannot disclose. Looking for advice + guidance from the community!