Рет қаралды 960
This is the eleventh video in a series where we will attempt to predict the winning probabilities for (MLB) Major League Baseball games using modern machine learning techniques. In this video, we shift our focus to predicting the number of runs scored. This requires us to wrangle our data into a new format where we have one teams hitting features, the opposite teams pitching features and try to predict the number of runs scored. We want to predict the distribution of runs scored, not just a point estimate, so that we can use these models to get probabilities for the over/under. As such we will employ a probabilistic regression technique called Coarsage (available in the StructureBoost package) to model these predicted conditional distributions.
Data: www.retrosheet.org, www.oddsshark.com
Notebooks: github.com/num...
github.com/num...
Personal links:
Consulting: www.numeristical.com
Github: github.com/num...