In case you missed the announcement: The Alteryx One Fall Release is here! Learn more about the new features and capabilities here
ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests after December 31, 2025. Set up your security questions now so you can recover your account anytime, just log out and back in to get started. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Regex to remove unicode for fuzzy match

maleryx
7 - Meteor

I am currently using REGEX_Replace(_CurrrentField_,'[^\w]', '') for all text fields. is there anyway to prevent unicode characters were not convertable?

 

ConvError: Fuzzy Match (29): Comments: Some Unicode characters were not convertable ("therrror ÃÂÃâšÂÃâÅthisisjustasamplesentencebecausethisisasamplesentecetoreplicable the error")

4 REPLIES 4
afv2688
16 - Nebula
16 - Nebula

Hello @maleryx,

 

You could try using this formula:

 

DecomposeUnicodeForMatch([Field1])

 

This converts your characters.

 

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Regards

maleryx
7 - Meteor

ConvError: Multi-Field Formula (42): DecomposeUnicodeForMatch: Some Unicode characters were not convertable ("xxxÃââœ199xx1")

ConvError: Multi-Field Formula (42): DecomposeUnicodeForMatch: Some Unicode characters were not convertable ("xαxxxxxx_x48xxx22142")

afv2688
16 - Nebula
16 - Nebula

Hello @maleryx,

 

The tool I showed you gets rid of most of the characters. There are som which cannot be converted like

 

ƒ

OR

œ

 

Those you either get rid of them with a regex replace tool or there is little much (as far as I know) to do about them

 

Regards

maleryx
7 - Meteor

Hello @afv2688

 

I am trying to ConvertToCodePage([_CurrentField_], 65001). will let you know the outcome.

Labels
Top Solution Authors