What is the best way to read PDFs of multiple formats within a single workflow using the same tool? Please provide an accurate and detailed explanation
@Pooja_31 if you have access to Intelligence Suite there's a PDF to Text tool: https://help.alteryx.com/current/en/designer/tools/alteryx-intelligence-suite/computer-vision/pdf-to...
The picture below is from the documentation link above and it shows how you can feed in a list of PDFs to this tool and therefore parse all of them. This is a useful technique when the formatting of the PDFs is not consistent. However, if the formatting is consistent, you can use some of the other Computer Vision intelligence suite tools to do some really cool things!
Any questions, please let me know. If Intelligence Suite isn't an option, you might need to leverage the Python tool to install an OCR python package which can be an alternate solution.