According to the documentation of the RegEx tool (and general practice AFAIK) '\w' in RegEx should be a short hand for [A-Za-z0-9_]. That is, all uppercase and lowercase unaccented latin letters, numbers and underscore

However in versions 2023.1+ (at least) \w allows any character from any alphabet.

Is this expected behaviour? Has this always been the case? Is there a setting I'm missing?
Thanks,
Ollie