Alteryx Designer Desktop Discussions

geeman · ‎07-26-2018

Hi,

I'm looking for an approach to read all the content inside an HTML tag with an ID that I derive dynamically in the workflow.

Eg.

<HTML>

...

</DIV>

<DIV id="section-2" class="someclass" > ...

</DIV>

...

</HTML>

A solution somewhat similar to the regex -- <div class=\"someclass\" id=\"' + [DIV_ID] + '\" .*?>.*?<\/div> (not working), that generates the output;

---

</DIV>

---

And eventually the goal is to extract and list out the <table> contents.

Could anyone provide some suggestions? Thank you in advance!

BenMoss · ‎07-26-2018

Do you have some sample data @geeman

Ben

CharlieS · ‎07-26-2018

IF it's all on the same line/record, then you could use the substring function. Here's an example:

Substring([Field1],findstring([Field1],"<table>")+7,findstring([Field1],"</table>")-(findstring([Field1],"<table>")+7))

Otherwise, like @BenMoss said: an example input file for us to use would be best.

geeman · ‎07-26-2018

Thanks for the replies @CharlieS & @BenMoss! I have attached a sample file for your reference. The actual html file is a very large file that grows dynamically, when the Ajax calls are made to load additional data in the tables.

CharlieS · ‎07-26-2018

I'm not sure exactly what you're looking for as far as table parsing goes, but I've attached a solution that will isolate the table contents with the section name appended.

geeman · ‎07-26-2018

Hi @CharlieS, this is close to what I'm looking for.. the only additional ask is to be able to pass the html element id (Div id, in this case) as variable/parameter to the Multi-Row Formula dynamically.. Thank you so much for your help!

CharlieS · ‎07-26-2018

How about a Join as a filter?

geeman · ‎07-26-2018

@CharlieS, Thank you, Appreciate your help!... actually I thought about putting a filter for the specific 'Sections' and it works similar to the join you suggested.

But again is it possible to pass a parameter in the Multi-Row Formula? The table content needs to be only from section-1, in your solution it is getting the table content for both section-1 & section-2. This is the reason why I was looking for getting only the section-1 block. Any suggestions?

Alteryx Designer Desktop Discussions

Parse all content between HTML tags using the element ID

Re: Date Time Function - Prioritization Base on Du...

Re: Running multiple alteryx workflows within alte...

Re: Selecting the columns coming after a specific ...

Re: Regex(?) formula to remove values matching the...

Re: Python ECC SAP Extract into Alteryx Workflow