Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Community is experiencing an influx of spam. As we work toward a solution, please use the 'Notify Moderator' option on the ellipsis menu to flag inappropriate posts.

Weekly Challenges

Solve the challenge, share your solution and summit the ranks of our Community!

Also available in | Français | Português | Español | 日本語
IDEAS WANTED

Want to get involved? We're always looking for ideas and content for Weekly Challenges.

SUBMIT YOUR IDEA

Challenge #40: Parsing a HTML File

danilang
19 - Altair
19 - Altair

Tricksy

 

Spoiler
Solution 40.png

Dan

CaraI
Alteryx
Alteryx

Fun and challenging!  

 

Spoiler
40 workflow.jpg
OldDogNewTricks
10 - Fireball

This one was tough for me, but I got there in the end.

 

Time to check out other solutions to see how I could have done it better!

 

Spoiler
My results - concatenated Doctor's with multiple specialties (yellow highlight) and stripped out unnecessary white space (peach highlight)
challenge_40_completedScreen_Mine.jpg

Solution provided results:
challenge_40_completedScreen_Solution.jpg
kat
12 - Quasar

Felt like a bit of a hack, curious to see what the solution is!

 

Spoiler
Challenge #40.PNG
danrh
13 - Pulsar

I'm not matching on everything, but when looking at the expected solution vs mine, I think I'm possibly more accurate in some instances?

 

 

Spoiler
For example, the solution provided has Alan Bielsky's practice as 'Anesthesiology', but it looks like in the html it's actually 'Pediatric Anesthesiology'...  Other examples as well, not necessarily the same type of issue, but for the handful that I compared I'm comfy with my solution :)
image.png

 

OllieClarke
15 - Aurora
15 - Aurora

<3 regex

Spoiler
Challenge 40.PNG
Kenda
16 - Nebula
16 - Nebula
Spoiler
Here is my solution. As @brianprestidge pointed out, I think the output provided in the start file is incorrect on line 649. Other than that, matches exactly!

40.PNG
KOBoyle
11 - Bolide

Solution attached. Similar to @Joe_Mako, I included all Practices (multiple <h5> tags) for each physician.

 

Spoiler
challenge_40_spoiler_KO.png
MarMu
8 - Asteroid

 

Fun to muck around with HTML web scraping.

I spend some time realizing that record 649 is corrupt as also suggested by a few posts.

 

Probably not the most smooth solution out there, but it works :)

 

Spoiler
challenge_40_start_file_MarMu.png
JoBen
11 - Bolide

Cheers!