Challenge #116: A Symphony of Parsing Tools!
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
A solution for last week's challenge has been posted here!
The NY Philharmonic, as various ensembles, has been performing for audiences around the world for over 175 years! Wow! This week's Challenge asks you to parse the data for each of their programs (not for the past 175 years...that file was HUGE!) from 2011 - 2017. For each program, identify the concert information (Date, Location, Time, etc), as well as the pieces played during that program and the solo performers (if applicable). Note: the posted solution has removed records representing an intermission.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I think there is an issue with the output provided, it doesn't match what I see in the XML.
See the Performance Date in the screenshot below:
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hmmm...let me investigate! Thanks for letting me know @mmongeon
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Good catch @mmongeon! I did something silly with my DateTime tool. The start file has been updated with the correct (at least until someone catches something else!) start file.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi,
This could just be my lack, but I am wondering why Program ID 11640 was excluded from the Output when the Performance date landed on 2011-09-07? See image below:
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Here is my solution.
I'm actually getting more data than was provided in the given Output, but, from the spot checks I've performed, I think my additional records are valid.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Another good catch @joe_yang! I've updated the start file again. Also, I think I found a point of discrepancy: I opted to remove the records containing "Intermission" in the column "interval". That means I removed records that @mmongeon's solution includes (and that are included in the original xml file). I'm loving all the intense data investigation this Challenge requires (especially on under-caffeinated Mondays....)!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
My XML parsing revealed that each program could have more that one concert, so my final counts were different from the challenge output.
(e.g. Program 11633 has 3 concert date× and the 3 pieces were played at each concert, so I ended up with 9 records.)
3 concert dates for Program 11633
My output for program 11633
Solution Output for program 11633
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator