community
cancel
Showing results for 
Search instead for 
Did you mean: 
Do you have the skills to make it to the top? Subscribe to our weekly challenges. Try your best to solve the problem, share your solution, and see how others tackled the same problem. We share our answer too.
Weekly Challenge
Do you have the skills to make it to the top? Subscribe to our weekly challenges. Try your best to solve the problem, share your solution, and see how others tackled the same problem. We share our answer too.
Unable to display your progress at this time. Please try again a little later, or contact an administrator if you continue to see this error.
Getting started with Designer? | Start your journey with our new Learning Path!

Challenge #103: Just another game?

Asteroid

Used the Association tool to check the correlation. Most of the variances have p-value not low enough to be significant. Below is what I got...

Challenge 103 SuperBowl Prediction.PNG

How to decide if Super Bowl is (or isn't) a game? Predictions for Super Bowl are just similar as those for week 13

Alteryx Certified Partner

My solution :)

Spoiler
challenge_103.PNG
Asteroid

Hi! Here my challenge :)

 

Spoiler
challenge_103.PNG
Bolide

I didn't quite meet the predictions of your model, but I came to a similar end result. It is not quite "just another game" in comparison to the week I chose. Anyway...Go Buccaneers!!!! 

buccaneers.png

Spoiler
Challenge 103.PNG
Pulsar
Pulsar
Spoiler
Capture.PNG
Alteryx Certified Partner

My solution. I decided to use only three variables considering the p-values.

Spoiler
21-06-_2019_23-09-36.png
Alteryx Certified Partner
Spoiler
103. Predictive.PNG103. Predictive 2.PNG103. Predictive 3.PNG


Could have parsed it in a cleaner way but it did the job! Then I learned about the association analysis tool from @Natasha's workflow and how to then use those chosen variables for regression analysis. Predicted scores were done using only one variable in the end, just to see how close you could get with one variable (Offense - PassY)
Asteroid

Looks like defensive stats might be better at predicting Super Bowl scores than regular season scores. Hopefully this reinforces the idea that a strong defense is a strong offense, and shows how that giving an inch can indeed result in losing a mile. 

Spoiler
ZH WF.PNG
Fireball

I've been sitting on this one for a looong time, because I got sidetracked trying to figure out how to use the R tool to generate residual plots (we were on an older version at the time, that didn't appear to include them). I finally figured out how to do it, even though they're part of the standard linear regression tool output now! I took the Intro to Advanced Analytics training at Inspire 2019, and we asked about residual plots. The instructor advised us to calculate the residuals and create the plots ourselves, so I'm guessing this is a pretty recent addition.

 

Workflow:

Spoiler
Parse and prep data:

challenge_103_01_parse and prep.png


Hold out the super bowl weeks, plus one random week from each year:
(I need to find a better way to randomly sample within groups - this works, but isn't reproducible)
challenge_103_02_ sample games.PNG


Data investigation:
challenge_103_03_investigation.PNG


Build the model and score the sample weeks:
(I used stepwise regression to select the model)

In practice, I'd do a little more investigation, as I'm not satisfied with this model. I'd also want to educate myself more on the "business context," since I know very little about football! 
challenge_103_04_prediction.PNG

Results:

Spoiler
Selected model:
challenge_103_04b_model_output.PNG


Predictions:
challenge_103_05_output.PNG 

 

Alteryx Certified Partner

 

Spoiler
103.pngwith many correlated features I should have considered extracting principal components, but I can interpret that because I don't know the sport. I manually removed fields from the equation (which doesn't even bother with interactions) starting with the field with the highest P value (thus lowest absolute t-value). I reran the workflow until I noticed that I had significance values of all below .1 so I stuck with that.