How to separate Latin and non-Latin characters ?

Hi everybody,

First of all, sorry for my english level...

Today, I have a huge list of person which contains many differents names, write like that "Arthur" or sometimes like that "آرثر" or that "亞瑟"...

How can I separate this differents versions?

I've already tried to use filter with this formula :

REGEX_Match([FirstName], "\w[\w+|\s|-]+")

But I don't know if Alteryx launch regex expression with "/u" at the end to match with full unicode or not....?

If somebody have an idea to help me..

Thanks ☺

Accepted answers

This should match the extended latin unicode range:

REGEX_MATCH([FirstName],"^[A-Z \x{80}-\x{24F}-]+$")

I think this:

REGEX_MATCH([FirstName],"^[A-Z -]+$")

should do what you need.

Thank you for your answer..

It's works for separate latin and non-latin characters, but unfortunately, this expression doesn't return accenteds characters...