Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Community is experiencing an influx of spam. As we work toward a solution, please use the 'Notify Moderator' option on the ellipsis menu to flag inappropriate posts.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

TSV file 4GB

MZ900605
8 - Asteroid

Hello guys I need help with reading and editing a tsv file, It is a chemical file in the pictures there is a sample of the data and the error am getting any help?

Screenshot (264).png

 

 

 

Error: Input Data (9): Error reading "C:\Users\wolve\Desktop\BindingDB_All.tsv": Too many fields in record #1013457

 

Screenshot (263).png

4 REPLIES 4
mceleavey
17 - Castor
17 - Castor

Hi @MZ900605 ,

 

This is usually due to the delimiter being present in a cell in a single row.

If you can post the data we can build it for you, but if you want to have a crack, try opening it with a \0 delimiter and then parsing it manually with text to columns.

 

M.



Bulien

MZ900605
8 - Asteroid

@mceleavey 

I have almost 49 columns, \0 delimiter will only show the first couple of tabs merged all in a row!

Screenshot (265).png

mceleavey
17 - Castor
17 - Castor

Correct.

You then manually parse them out by whichever delimiter the file uses, or indeed by the number of characters in a fixed width file.

 

The problem you're having is that in the row that has the error, the format is broken, so it can't split it into the same number of columns. 

That means it can't auto parse it into columns.

 

When you have corrupt data, you need to address it manually.

 

M.



Bulien

mceleavey
17 - Castor
17 - Castor

@MZ900605 ,

 

Simply apply a text to columns tool to the single column and use \t as the delimiter.

 

I've attached the following example:

 

mceleavey_0-1636648858607.png

 

mceleavey_1-1636648877159.png

 

mceleavey_2-1636648896941.png

 

mceleavey_3-1636648907850.png

 

You will probably then need to fix that broken row, but it will process and allow you to sort that out after the fact.

 

M.

 

 



Bulien

Labels