Alteryx Designer Desktop Discussions

armouche · ‎06-07-2024

Hello.

I have a long text containing several instances of the word "credible". For example: sentence 1. sentence2 that contains the word credible. sentence 3. sentence4 that contains the word credible. sentence5 that contains the word credible. sentence6. sentence7. etc,,,,

I want to be able to output a new column that contains 30 words or 100 characters before the word "credible"+ the word credible+ 30 words or 100 characters after the word "credible". In the example above, I will have 3 columns.

I tried RegEx with the expression credible\s+\w+ but getting only the word credible+ 1 word after. The setting is that the output tokenize 20 columns. I am new to RegEx.

Appreciate your help.

PhilipMannering · ‎06-10-2024

To get part of the way there, you might try replacing credible with "|credible|",

replace([my_field], 'Credible', '|credible|')

And then use a text to columns.

Feel free to send an example of the input and output you expect if you're having difficulties thereafter.

armouche · ‎06-12-2024

Thank you. Partially worked with replace. I ended up using (?:\S+\s+){0,30}\bcredible\b(?:\s+\S+){0,30} and that worked much better. Thank you all.

Vampire
Zombie
Ghost
Pumpkin

Alteryx Designer Desktop Discussions

Extract N words in a string before and after a certain word