Weekly Challenge

Solve the challenge, share your solution and summit the ranks of our Community!
IDEAS WANTED

We're actively looking for ideas on how to improve Weekly Challenges and would love to hear what you think!

Submit Feedback
We've recently made an accessibility improvement to the community and therefore posts without any content are no longer allowed. Please use the spoiler feature or add a short message in the message body in order to submit your weekly challenge.

Challenge #89: Analyzing Social Data

Highlighted
Alteryx Partner

Simple union to combine all files and a small bit of cleaning the data

Spoiler
Highlighted
Alteryx Partner

summarizing the important tweets by retweeting count..

Spoiler
Highlighted
8 - Asteroid

My Effort. Looked at tweets by day by hashtag, tweets by hour and top 10 tweeters

Spoiler
Highlighted
Alteryx Partner

Trying to avoid the word cloud....

Highlighted
Alteryx Partner

My solution:

Highlighted
8 - Asteroid

Highlighted
8 - Asteroid

I decided to union and aggregate the data by user, how many related tweets they made, the exposure in terms of followers and retweets, and what hashtags were used.

Spoiler
Highlighted
8 - Asteroid

I decided to look at some summary statistics and identify highest users and highest used hashtags. There is an average of 3.8 hashtags per tweet, and the most commonly used hashtag is #SDGs. As it turns out, the top 19 users are responsible for 10% of total tweets. If we limit this to original content (filtering out retweets), there are 46 users who generate the top 10% of content. Looking at what gets retweeted, there is one user - ONE! - whose retweets account for 8.5% of all retweets. Next steps for me would be locating these users on a map, and generating some heatmaps for the hashtags.

Highlighted
8 - Asteroid

I kept it pretty simple but aimed to understand the relationship between followers and total tweets. I could probably find some more value in removing outliers, but the outliers were actually what I found interesting. The accounts with the most tweets are in the lowest group of followers, and the opposite is true for the accounts with the most followers (very few tweets).

Spoiler
Highlighted
8 - Asteroid

@JefBus Your solution to challenge #89 is awesome. Great job.