Alteryx Designer Desktop Discussions

jt_edin · ‎03-05-2020

Try as I might, I can't find the example I need to do this simple regex extract. Please can someone help real quick?

Here is the relevant line of html as it appears in the source of my browser:

Name: J. Robertson

I want to capture the text in bold above (J. Robertson)

So in regex speak I believe I want to search for this text:

Name:

...and then start a marked group, and capture everything until the next instance of which closes the marked group. Pretty simple huh, but I can't figure it out and I can't find a help example. How do I do this? Thanks!

mceleavey · ‎03-05-2020

Hi @jt_edin ,

I've attached the workflow. The first one parses out a single instance of the name, the second is where there are multiple names and it parses out to rows. There's probably a better way of doing it if I had the full HTML, but given what I can see, that should work.

Hope this helps,

M.

fmvizcaino · ‎03-05-2020

Hi @jt_edin ,

Attached is an example showing how to do it.

I'm using tokenize method to get all incidences of that structure.

Best,

Fernando Vizcaino

jt_edin · ‎03-06-2020

Thanks both. I have accepted @fmvizcaino 's solution as it most closely matches the single tool approach I had in mind, however @mceleavey 's is excellent for working through the problem step by step, so thanks.

@fmvizcaino Would you be able to explain what happens within the parentheses of the marked group, both for my benefit and others?

([^<]+)

What do these symbols mean, and where would you recommend we go for help to understand them? I find Regex baffling and I'm sure I'm not the only one!

Alteryx Designer Desktop Discussions

Regex extract text from html using tags

Re: Is there any way the computer vision tools can...

Re: Batch Macro

Re: How to get cell reference address from excel

Re: Replacing Forecast columns with Actual Data

Re: Row creation