Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Remove everything outside four quotation marks in one string

mrbubi001
6 - Meteoroid

Hello i have the problem that i can't find a way to remove all the tags outside quotation marks. 

I have the following string:
<a href="https://adsecurity.org/?p=3299">https://adsecurity.org/?p=3299</a><br> <a href="https://attack.mitre.org/mitigations/M1047">[MITRE]Mitre Att&ck - Mitigation - Audit</a>

And what i want is only the urls so:

https://adsecurity.org/?p=3299 https://attack.mitre.org/mitigations/M1047

 

Thanks to whoever can help me!

1 REPLY 1
Jonathan-Sherman
15 - Aurora
15 - Aurora

Hi @mrbubi001,

 

You could achieve this with a piece of Regex in tokenise mode:

 

 

a href="(.*?)">

 

 

JonathanSherman_0-1637143814638.png

 

I've attached my workflow for you to download if needed!

 

Kind regards,

Jonathan

 

Labels