Community Spring Cleaning week is here! Join your fellow Maveryx in digging through your old posts and marking comments on them as solved. Learn more here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Filtering records with English and non-English characters in a single field

hamdylie
7 - Meteor

I am a newbie. I have records of first and last names that may contain English & non-English characters in a single cell, e.g. Japanese or Chinese characters with with Latin alphabets. How can I filter these records? Thanks.

4 REPLIES 4
s_pichaipillai
12 - Quasar

@hamdylie

 

there is a filter tool available under preparation 

you can use it for filtering your data

 

if this does not help you please post some sample data , we can help you 

 

Thanks

Saravanan

jdunkerley79
ACE Emeritus
ACE Emeritus

Assuming you only allow English alphabet characters then use a Filter Tool with a Custom Expression:

 

REGEX_Match([FirstName], "^[A-Za-z]$")
&&
REGEX_Match([LastName], "^[A-Za-z]$")

The True output would be those names with only English characters

 

Hope it helps

Shaaz
9 - Comet

Hi Sarav,

Can you please let me know how to separate English and Non English characters from a dataset. Below is the sample dataset.

 

Phrase                           Value

 

你好吗                            1000

好不好                            20000

这是什么?                       4500

Good or Bad                   876899

Old                                  6781

ABCD@_$%&123           67899

 

Thanks in Advance        

 

Shaaz
9 - Comet

It can be done using Regex_Match in Filter Tool. Use the below code.

 

REGEX_Match([Field 1],"[^\x00-\x7F]+")

 

True will give all Non English Characters.

False will give English Characters.

Labels