Weekly Challenges

Solve the challenge, share your solution and summit the ranks of our Community!

Also available in | Français | Português | Español | 日本語
IDEAS WANTED

Want to get involved? We're always looking for ideas and content for Weekly Challenges.

SUBMIT YOUR IDEA

Challenge #13: HTML Table Parsing

EmanueleE
Astéroïde

my solution

 


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

dominiklz
Astéroïde
Spoiler

this was a fun one to wake up in the morning 🙂 


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

adamp
Météore

I'm in need of an assist on this one. I'm trying to work the problem all the way through without checking the solution... Thanks ahead of time for any help :)

 

Révélation
I've chosen to use RegEx to parse out the HTML. I've setup the RegEx tool as follows...

Alteryx103a.png

...and I'm getting tripped up on how to modify the regular expression to drop all that extra text coming into the first record without subsequently loosing text elsewhere...

Alteryx103b.png

Can someone throw me a hint and/or help me through the regular expression here?
1) I'm not 100% sure why the <td></td> at the beginning of Record#1 isn't coming back as its own record... wouldn't the asterik * in the RegEx allow for there being nothing between the <td> and </td> ?
2) I've tried putting a few of the characters from the "<table style="font-family:Arial,Verdana,Times;font-size:12px;text-align:left;width:100%;border-spacing:0px; padding:3px 3px 3px 3px">" section of the string into the exclude portion of my RegEx only they don't seem to work. Probably because I'm not using them correctly or haven't positioned them correctly?  Altered Regular Expression <td>[^ "=].*?</td>


bdaniels
Astéroïde

solution attached


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

CHarrison
Astéroïde

Probably could've cleaned this up a bit better, but it works 😉 


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

locunhalima
Météore

My solution.


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

ivoller
Quasar

That one was fun. My solution was a little clunky until I saw the neat trick from @alex

 


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

joshuaburkhow
ACE Emeritus
ACE Emeritus

A good one! 


This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.

Joshua Burkhow - Alteryx Ace | Global Alteryx Architect @PwC | Blogger @ AlterTricks
dillon_bok
Astéroïde

Definitely not dynamic but here is my solution:

Révélation
Week13.PNG
jsdeter
Astéroïde

This post has been edited by Community Moderation to redact sensitive attachments. The original attachment has been replaced by post_placeholder.txt.