I have a set of data of about 6~7 millions lines, and I can't set it correct, when adding the file to Alteryx it correct be distributed on each field.
Every field is between quotation marks ("Data1") and delimited by semicolons (;), but some fields has semicolons between the field eg; ("0,0225200";"JÉSSICA SANTOS";"32877077870") or ("DUQUE DE CAXIAS";"AV; NILO PECANHA";"1642").
Every time I try to import, it make some of the rows be like null.
Thanks in advance.
It would appear that the sample file you attached read in correctly. Does this not look as you would expect?
The #9 option in the configuration window allows you to ignore delimiters in quotes which should take care of your issue.
The sample you provided seems to be assigning the correct fields under the correct columns.
You can change the option in the Data Input to ignore delimiters inside of quotes, single quotes or automatically.
Pedro.
Well, the thing is well I Run the script it goes wrong, it doesn't split the fields correctly as the image shows.
I'm setting the file as that:
Felipe,
When you upload the file into Alteryx, is this the configuration you are selecting?
Pedro.
Yes, I'm selecting this option and use the correct delimiter, as your image.
Felipe, can you please upload the dataset in the image example you just sent? Just to make sure I am looking at exactly the same data you sent in this image.
Pedro, when I run on the Sample that I sent to you, it works fine, the issue happens when I try with the full database, which has over 7 million lines.
This is when I work with the Sample, as you see, it doesn't have any issue.
And on the image below, is what happens when I run the full file.