We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Replacing Special Characters

Adityasa
7 - Meteor

Hi,

 

I have a column with some addresses, and they have special characters like in the attached file.


I believe these are Turkish names. Can someone help to change these to normal English characters, as I have plenty of these?

 

Thanks.

3 REPLIES 3
Luke_C
17 - Castor
17 - Castor

Hi @Adityasa 

 

Try using the DecomposeUnicodeForMatch() function.  This should convert the characters. To do multiple columns at once consider using a multi field formula tool. The function would look something like this:

 

DecomposeUnicodeForMatch(_CurrentField_)

Adityasa
7 - Meteor

Hi @Luke_C ,

 

These special characters are getting converted to x001a. I don't think that's correct.

 

Thanks.

danilang
19 - Altair
19 - Altair

Hi @Adityasa 

 

The _x001A_ character strings are actually in the excel file.  You can see this if you change the extension from .xslx to .zip and then look in \xl\sharedString.xml.

 

danilang_0-1618919388653.png

 

In unicode, _x001a_ is known as a substitution character and is used when the under lying byte string has "malformed data, when trying to interpret data as characters" See here for more info.  The _x001a_ code was added to the excel file in place of the malformed data when the excel file was created.   Since the code can represent any combination of bytes, there's no way to extract the original text.  You'll need to go back to the source of excel file and see if they can provide cleaner data.

 

 

Dan 

  

Labels
Top Solution Authors