Chapter 6 Appendix | Beating Vegas: Creating a Dynamic Sports Betting Model

6.1 Example Row of Dataframe used for Modeling the Game Score

Table 6.1 shows one row of the dataset used for modeling. For size purposes, I included just general columns for the key variables used in modeling, but there are individual columns for both the home and away teams that show the statistics of each team. The DVOAs listed are percentages. My mixed-linear models utilized only the listed variables, and transformations of the listed variables.

Table 6.1: Example Row of Data Used For Modeling
Date	Game ID	Home Team	Away Team	Home Score	Away Score	Week	Year	Total DVOA	Weighted DVOA	Record	Offense DVOA	Defense DVOA	S.T. DVOA	Cash Bet	Cash Percent	Ticket Number	Ticket Percent
2017-09-10	INDvLAR	LAR	IND	46	9	1	2017	-5.1	4.8	8-8	7.3	5.1	2.6	95774	0.29	478	0.21
Note:
These stats are available and specified for the home and away team.

6.2 Modeling Number of Observations

Table 6.2: Row of Dataframe used for Observation Point Model
Final Number of Obsv.	Obsv. at Time	Total Ticket Number	Total Cash	Week
188	132	1467	210373	10

**Output of Observation Number Model**

	Dependent variable:

	Number of Observations

Log(Total Ticket Number)	-9.824
	(1.105)

Log(Total Cash Bet)	-16.211
	(0.938)

Observations up to Point	0.971
	(0.012)

Constant	311.840
	(4.786)

Figure 6.1: Residual Plots for Observation Number

Figure 6.2: Residual Plots for Observation Number by Week

Figure 6.3: Residual Plots for Observation Number by Week

Table 6.2 is an example row of the dataframe used to model the number of observations that will be in a series based on the amount of cash, tickets and observations up to a time \(t\).

Table ?? displays the parameters for this model, while Figures 6.1 — 6.3 shows the diagnostic plots for this model. The residual plots show that a mixed linear model is a good approach. The residuals, while not perfect, seem to follow the normal distribution and the residuals are relatively evenly distributed for each week – the random effect.

6.3 Example Row of Test Data set with Probabilities

The test data set includes all the same columns shown in 6.1, in addition to the following key columns (and more for each of the different betting strategies). Using the K1 betting strategy for the game between the Indianapolis Colts and Jacksonville Jaguars on November 11, 2018, the spread at the first decision point was the away team, the Jaguars (+3). At this decision point spread, I expect the away team to beat the spread with a proportion of \(1 - 0.444 = 0.556\). If the Bet Team variable is equal to 1, it means the bet will be on the away team. With an expected value of betting on the Jaguars (+3) of 0.0676, the K1 betting strategy calls for me to bet on the Jaguars. My allotment for this bet is 6.76% of my current bankroll. If the future forecasted spread were to make the game more advantageous to bet on, in terms of expected value, then I would only be one-third of my allotment now. However, the forecasted future spread has the Jaguars (+2.5), which would be a much worse bet. The expected value of the Jaguars (+2.5) is not only lower than our expected value at the first decision point, but it is negative. Because the expected value would lower by betting on the game at the number my DLM forecasts for the spread, I bet my full allotment at Jaguars (+3), and there will be no future bet, regardless of where the spread actually does move. The Future Bet Team column refers to the fact that because the simulated probability is below 0.5, I would bet on the away team (the Jaguars), if I were to make a future bet.

6.4 Diagnostics for Random Effects in Mixed-Linear Models for the Score Difference

Figures 6.4 and 6.5 show the diagnostic plots for the random effects of the first mixed linear model, while Figures 6.6 — 6.8 show the diagnostic plots for the random effect of the second mixed linear model. The residuals seem relatively consistent throughout the groups, and the mixed linear models seem to fit both models appropriately.

Figure 6.4: Team-Specific Model Diagnostics by Away Team Group

Figure 6.5: Team-Specific Model Diagnostics by Away Team Group

Figure 6.6: Betting-Trend Model Diagnostics by Away Team

Figure 6.7: Betting-Trend Model Diagnostics by Away Team

Figure 6.8: Betting-Trend Model Diagnostics by Year