Hello,
I have an excel file, which i converted from PDF, but during the conversion, all the column merged into one row and data appears in one row, how i can sperate all.
Not gonna lie, that conversion must have done an awful job - the whole data structure is whack.
Is there any other outputs you can use, maybe .TXT? Then you can delimit it better. Right now, even with Alteryx, you may need to split the file into three, then from there, each data row needs to be parsed - but problem is, the names, address etc. each have a non-standard way of delimiting, thereby making it very difficult to do.
Hi, @bilamalik
Your PDF is too large, with over 1000 pages! My suggestion is to convert to docx or xlsx format through other methods like python(in fact your pdf created by word2019), which will reduce the difficulty of processing.