This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
I have a pdf file that has tabular data with 5 columns. I have used python tool and converted the pdf file to text. Now I got all the columns and its data in only one single column one followed by other...I have to split this row into individual columns and load in into a table. There are no delimiters. The data in pdf looks similar to the one attached.
If that that's the case and all 6 columns are included that would give you 34 original rows = 234 records, incl the column names. Your data appears to missing a value. Probably a border condition missed by the python script
The attached workflow. Takes the input data, with the embedded headers
calculates how many rows there should be based on 6 columns and performs a crosstab
Thanks this helped me to some extent. I could convert the first 2 columns but for 3,4,5,6 columns I have all the 4 column names one below the other and then the column values followed. After the column names 4th column values started. As the 3rd column has only 2 values that came up at very last rows.