Hi,
The Excel file that I'm working with is converted from PDF, so the format is really off for certain lines. I'm trying to match up the columns by concatenating everything in Excel and then using text-to-columns in Alteryx. However, I have some columns that are not filled for every entry (Ex: A and F), which can lead to column mismatches when I concatenate and then text-to-columns. I'm thinking of using RegEx to take out the 4 in A2 and 113121 in F3, but I'm not sure what the right equation might be.
The first picture is the raw data imported from PDF, the second picture is what I'll be uploading to Alteryx. If anyone has a better way to clean up the data, please let me know too!

