Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Cleaning data

Esha_Reddy
5 - Atom

Hello,

 

I'm having trouble cleaning this data to as shown below (2nd table),

 

Entity;"Alternative Name";"Story";"Year";"Records Lost";"Sector";"Method of Leak";"1st source";"2nd source";"3rd source";"Source name"
River City Media;"Spam operator";"A dodgy backup has allegedly resulted in over a billion leaked email addresses

 

Could someone please help me get through cleaning this kind of dirty data.?

 

Also attached is the file I'm trying to clean.

 

Output should look like,

EntityAlternative NameStoryYearRecords LostSectorMethod of Leak1st source2nd source3rd sourceSource Name
River City MediaSpam OperatorA dodgy backup has allegedly resulted in over a million leaked email addresses20171,37e+09WebAccidentally Publishedhttps://betanews.com/2017/03/06/river-city-media-spam-database-leak/  Beta News

 

 

Thanks in advance.

2 REPLIES 2
Aguisande
15 - Aurora
15 - Aurora

Hi @Esha_Reddy 

 

What I'd do is to grab your input data, and read it as a csv, using "\0" as delimiter and untick "First Row Contains Filed Names".

Once the 278 records (including the headers) are into Alteryx, just use a Cleanse Tool to get rid of the punctuation in the values (If you want, you can keep Field 8 out this cleaning process, since all ":", "/" from the urls are going to be removed). You can remove the quotes of this field with the Formula tool later (ReplaceChar([Field],"Char to find", "Char to replace with")

Next, use a Text to Columns tool to separate the 11 columns using ";"as the delimiter.

 

Finally, with the Dynamic Rename Tool, connect your data to the top input and select Take Field Names from the First Row of Data.

 

Probably, your data may need further cleaning, but this will help you start. I'm attaching the previous steps in a WF to give you a better idea of the whole process.

Best,

_AG_

PS: As a best practice, use a Select Tool after the Text to Columns, and get rid of the Original field.

aish49
5 - Atom

Hi ! Please find the workflow in the attachment. Hope it solves the problem!

 

Solution.png

 

Labels