Hi Alteryx Experts,
I've been working on extracting data from a few PDFs into Alteryx, but the extracted data is misaligned across multiple columns. I’ve tried multiple methods, including Transpose, Cross Tab, Multi-Row Formula, and Text to Columns, but I haven’t been able to get the expected result.
Challenges I am Facing:
- Some headers are misplaced, like "1st Inst" sometimes appearing in Column 4 instead of its correct column.
- Some values are shifting across multiple columns after extraction.
- Some values are misaligned due to empty spaces in the PDF structure.
- Some rows (like "pooling") are entirely empty but need to stay in the correct position.
What I Have Tried So Far:
- Filter Tool – Removed unnecessary "Table Header" rows.
- Multi-Row Formula Tool – Tried shifting column values where they are misplaced, but couldn't fully align them.
- Transpose & Cross Tab Tools – Attempted reshaping the data, but misaligned numbers remained.
- Text to Columns Tool – Tried splitting the data correctly but faced inconsistent column placements.
Request for Help
Can anyone guide me on the best approach to correctly format this extracted data in Alteryx? Would really appreciate any suggestions, workflows, or logic to apply!
Thanks in advance!