I need to create an index/concordance from a PDF file. Can Alteryx do this?
I have a PDF file that is OCR'd. I want to create an index/concordance (I can't figure out the best word to use here.) with the output showing a list of keywords and their location within the PDF file. Think about an index at the end of a textbook that shows every location for the word "cardinal" or "bluejay" or "oriole." A page number for the location would be acceptable; however, ideally instead of page number, it would show the chapter, section, and paragraph that the word is located in.
Example:
cardinal--Bird Chapter, Red Bird Section, Paragraph 1
bluejay--Bird Chapter, Blue Bird Section, Paragraph 4
oriole--Bird Chapter, Orange Bird Section, Paragraph 2
--Bird Chapter, Black Bird Section, Paragraph 3
Any suggestions?