Want to get involved? We're always looking for ideas and content for Weekly Challenges.
SUBMIT YOUR IDEAHi Community members,
A solution to last week’s challenge can be found here.
This challenge was submitted by James Bevan, @JBevan89 . Thank you, James for your submission!
For this week’s challenge, we will be working with the TMDB Movie Metadata dataset from Kaggle, a rich collection of information about modern films, including revenues, budgets, and production details.
Your role as a data analyst is to dig into the data and uncover insights by answering the following questions:
Once you have completed your challenge, include your solution file and a screenshot of your workflow as attachments to your comment.
The Academy Team
For Task 1, there are a few ties so I did my best to apply some sorts to make this mostly match the desired output. For Task 3, my answer ended up being one less than the provided solution... wondering if perhaps it counted the movies that didn't have a listed spoken language as a separate language??? (I didn't count that in mine).
@Hub119
You beat on by speed. 😁
I came to the same observation as you.
For Q1, there are many ties, around 10th rank.
For Q3, I would argue that we should use the ISO 639-1 Language Code, rather that the Language name, since the column contains unicode.
then I came up with 87 distinct languages.
@alineruizcampos
Thank you for agreeing with me.
I checked our result and feel maybe the JSON Parse may have failed on row#2087, where the name is empty.
That is the only difference between our result for Q3.
@Qiu okay, glad I'm not crazy... I was also getting that count of 80+ records when looking at unique 2 letter language codes. I switched to pulling the listed name in order to try and match the provided solution.
@Hub119
Given what you have done in Advent of Code , its very difficult for me to be convinced that you are not crazy. 😁
@Qiu totally fair statement 🤣
My answer did not match, just like everyone else's.
Fun challenge. Like others, my answer to part 3 didn't quite match, but I got the other parts correct.