Modeling the NFL Betting Market

Baltimore Ravens

Looking ahead to the 2025/26 NFL season, I analyzed DraftKings betting lines to understand how point spreads and over/under totals reflect expectations about team performance. Using data from each week’s published lines, I estimated statistical models to recover implied team strength and scoring profiles directly from the market itself.

To create the dataset for this analysis, I compiled weekly NFL betting lines for the 2025–26 season. For each game, I extracted the point spread, over/under total, and team identities from the JSON data behind the sportsbook’s odds tables. I processed this data in R, retaining one record per team per game and pairing home and away teams to calculate spreads and totals. By standardizing team names and focusing on consistent sources (DraftKings lines), I assembled a clean, structured dataset that allowed me to estimate team ratings and scoring tendencies directly from the market’s expectations.


Modeling Point Spreads

We began by modeling the point spread between two teams as:

Point Spread = Rhome Rroad + HFA + ε

Where

  • Point Spead is the number of points the home team is favored by (negative if home team is underdog),
  • Rhome and Rroad are latent ratings for each team,
  • HFA: home-field advantage (estimated to be around 1.5 points),
  • ε is an error term

By regressing observed spreads on team indicators (while imposing a constraint that the average team rating is 0), we recovered each team’s market-implied rating. These ratings can be interpreted as the number of points a team is expected to be favored over an average opponent on a neutral field. In any game, the point spread in favor of the home team is equal to the home team’s rating, minus the road team’s rating, plus 1.5 points. The model offers a near-perfect explanation for point spreads (R2 = .98).


Modeling Over/Under Totals

Next, we modeled the over/under line (expected total points scored in a game) using:

OUij = Ti + Tj + μ + ε

Where

  • OUij is the over/under line in a game with teams i and j,
  • Ti and Tj are team-specific effects on the expected point total,
  • μ is the league-wide average expected points (estimated to be 45.7)
  • ε: residual error

This symmetric model assumes each team contributes additively to the total expected points — without needing a home-field term — and fit the market lines remarkably well (R² > 0.99).


Visualizing Team-Level Effects

We visualized these team effects using a one-dimensional plot with team logos scaled and positioned according to their estimated ratings or scoring impact. These visuals make it easy to grasp, at a glance, which teams the market considers strongest and which tend to produce high- or low-scoring games.

To illustrate, consider a game between the Detroit Lions (R = 3.7, T = 2.2) and Chicago Bears (R = 0.2, T = 0.5). If Detroit is the home team, the Lions will be favored by 3.7 – 0.2 + 1.5 = 5 points. If Chicago is the home team, the Bears will be 0.2 – 3.7 + 1.5 = -2 point underdogs. At either stadium, the over/under line is 2.2 + 0.5 + 45.7 ≈ 48.5 points. In this manner, one may obtain point spreads and over/under lines for the entire NFL season.

The most striking feature of the betting line models uncovered in this analysis is their simplicity. The point spread and over/under values, months before the season begins, are simply additive models based on the two teams involved. None of the factors that commentators and fans talk about — winter weather, rest after bye weeks, night v day games, travel distances, division rivalries, etc. — play any role in the betting lines.

As the season unfolds and the markets obtain more information about teams, I would assume that the basic models remain the same with some adjustment of team-specific factors. For example, if Detroit loses a key player from its offense, its R should be adjusted downward, and possible its T as it may be expected to score fewer points.


Parity and Prediction

A key insight involves translating point spreads into win probabilities. If a team is favored to win by 5 points, what is the probability that they will win? The win probability is a function of the point spread, taking into account the underlying variability of NFL margins of victory.

P ( Win ) = Φ ( Point Spread σ )

Where

  • P(Win) is the probability of the home team winning,
  • Point Spead is the number of points the home team is favored by (negative if home team is underdog),
  • σ is the observed standard deviation of margins of victory, ≈13.45,
  • Φ is the cumulative density function for the normal distribution.

In practice, the relationship between spread and win percentage is nearly linear for point spreads between –14 and +14. Referring to the figure, one can see that few NFL games have extreme point spreads. When the Jets visit the Bills in Week 18, the Bills are favored to win by 5.2 – (-4.7) + 1.5 ≈ 11.5 points, an extreme point spread by NFL standards. A widely used approximation that follows the formula above is:

Win % 50 + 3 × Spread

This implies that a team favored by 1 point has roughly a 53% chance to win, a heuristic that aligns closely with historical outcomes. When the Bills are favored by 11.5 points, possibly the most extreme case, their win probability is 84.5%. In a more typical situation, Bears -2 point underdogs, there is a 44% win probability. Team R and T values represent tendencies in a game characterized by random forces.

It is also interesting to note that there is more variability in R values than T values and some weak positive correlation between the two values. The teams expected to have “high scoring games” have T values around 2 and the team expected to have the lowest scoring games, Cleveland, has T = -3. You can see some difference between teams expected to succeed from strong defense (e.g. Philadelphia and Kansas City) and those expected to succeed with more scoring (e.g. Baltimore and Buffalo), but the expected difference is less than a field goal.


Further Reading

  • Stern, H. (1991). On the Probability of Winning a Football Game. Journal of the American Statistical Association, 86(415), 816–823. JSTOR Link
  • Levitt, S. D. (2004). Why Are Gambling Markets Organised So Differently from Financial Markets? The Economic Journal, 114(495), 223–246. Wiley Link
  • Harville, D. A. (1980). Predictions for National Football League Games via Linear-Model Methodology. Journal of the American Statistical Association, 75(371), 516–524.

By reverse-engineering betting lines, we can see how sportsbooks distill expectations into numbers — and how much information is already priced into the market. These results suggest that a large portion of what oddsmakers do can be uncovered with simple, elegant statistical tools.

Leave a Reply

Your email address will not be published. Required fields are marked *