Hi,
I have a column with some addresses, and they have special characters like in the attached file.
I believe these are Turkish names. Can someone help to change these to normal English characters, as I have plenty of these?
Thanks.
Hi @Adityasa
Try using the DecomposeUnicodeForMatch() function. This should convert the characters. To do multiple columns at once consider using a multi field formula tool. The function would look something like this:
DecomposeUnicodeForMatch(_CurrentField_)
Hi @Luke_C ,
These special characters are getting converted to x001a. I don't think that's correct.
Thanks.
Hi @Adityasa
The _x001A_ character strings are actually in the excel file. You can see this if you change the extension from .xslx to .zip and then look in \xl\sharedString.xml.
In unicode, _x001a_ is known as a substitution character and is used when the under lying byte string has "malformed data, when trying to interpret data as characters" See here for more info. The _x001a_ code was added to the excel file in place of the malformed data when the excel file was created. Since the code can represent any combination of bytes, there's no way to extract the original text. You'll need to go back to the source of excel file and see if they can provide cleaner data.
Dan