Reading multiple pdf files with diffrent formats
Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
MAYANK_F
5 - Atom
08-14-2024
06:07 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi everyone i want to build an iterative macro to input multiple pdf files with inconsistent format and then output certain records that are common in that but i am not able to get a starting point around this use case.
Labels:
- Labels:
- Iterative Macro
- Macros
1 REPLY 1
griffinwelsh
12 - Quasar
08-14-2024
10:07 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@MAYANK_F This depends on how inconsistent your data is. If the files are sometimes image based text you will need to use OCR otherwise you can just fetch the text. Either way you need to use Python with a package like pymupdf or the computer vision tools from intelligence suite.

