Hi all,
So I have this challenge where I need to build a dashboard on statistics coming from very "dirty" data.
This data was originally published in PDF format.... The picture below gives an idea of the format (many merged cells)
This PDF was converted into a text file using a tool called Tabula. My problem is that Tabula does not like merged cells and considers them as only one cell. The result of the conversion is attached. So instead of the year of application spanning two columns, it spans only one and that means that most columns are labelled with the wrong year or no year at all...
I'm a beginner at alteryx so I was wondering if there was a solution that allowed to do that efficiently and that works if the PDF gets updated.
Thanks!
Negarev
Solved! Go to Solution.
Awesome, thank you!
User | Count |
---|---|
17 | |
15 | |
15 | |
8 | |
6 |