Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Community is experiencing an influx of spam. As we work toward a solution, please use the 'Notify Moderator' option on the ellipsis menu to flag inappropriate posts.
Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Regex Remove HTML Containers

Afammy
7 - Meteor

My input includes html containers and I need to remove them (characters and html commands) and leave just the actual text values:

 

<div>How do I remove the html tags?</div><br><div>Looking forward to the answer.</div>

 

Need to see: How do I remove the html tags? Looking forward to the answer.

 

Thank you.

6 REPLIES 6
MarqueeCrew
20 - Arcturus
20 - Arcturus

@Afammy,

 

I took a stab at this for you using your sample container entry.

 

Screen Shot 2017-02-01 at 11.39.38 AM.png

 

I added a space between the text.  You might want to have a PIPE delimiter left behind instead of the space.  Please try this and let me know if it helps.

 

Trim(
          Regex_Replace(
                                    Regex_Replace([HTML_Container],"<.*?>",'|'), "[|]{1,}",
                                                              ' ')
                                    )
Trim(
          Regex_Replace(
                                    Regex_Replace([HTML_Container],"<.*?>",'|'), "[|]{1,}",
                                                              '|')
                                    )

I've got the two sets of code (space and pipe replacements) above.  

 

Trim() gets rid of any spaces at the front or end of your field.

Regex_Replace uses a wild card for all sets of data enclosed by <> and replaces them with a '|' pipe.

Regex_Replace uses a search for  multiple pipes and replaces them with a single pipe.

 

At least for your test data it seems to work.

 

Cheers,

Mark

Alteryx ACE & Top Community Contributor

Chaos reigns within. Repent, reflect and restart. Order shall return.
Please Subscribe to my youTube channel.
Afammy
7 - Meteor

Thanks @MarqueeCrew! Worked like a charm. You also gave me a mix of formula and regex to learn and use moving forward. 

halfling
5 - Atom

Thanks!  This was just what I needed to solve my problem.

wale_ilori
9 - Comet

Thank you!! Just what I needed right now.

AshimaPahwa
5 - Atom

Thankyou so much for the solution.

 

sdodero
6 - Meteoroid

Thanks Amazing!

Labels
Top Solution Authors