The Pythagorean win expectancy model developed by Bill James remains one of the most celebrated results in sports analytics. Many have extended the application of this model from its original use in baseball to other sports. Others have shown technical scoring conditions that imply the equivalence of win probability and the Pythagorean model. However, no explanation has been offered for why different sports yield different results beyond “that’s what the data say.” This article presents a theoretical analysis of the Pythagorean model by first deducing an exact within-team equation relating win percentage to seasonal scoring records, and then reconciling mathematically this result with the Pythagorean model which is cross-sectional across teams in a league. We derive a complete decomposition of the Pythagorean coefficient γ in terms of the exact model, and show that γ captures two key quantities – average points per game, and the average margins of victory and defeat – that together explain why different sports yield different results. We demonstrate this decomposition using the past decade of seasonal results from MLB baseball, NBA basketball, NFL football, and NHL hockey, and show that the data do reflect the properties deduced in our analysis.
The Pythagorean win expectancy model, introduced by Bill James as a method for converting total runs for and against a baseball team to an estimate of that team’s seasonal win percentage (James 1980), is one of the most celebrated and well-studied models in sports analytics. This model is covered in many sports analytics texts (e.g. Winston 2012; Severini 2015), and has been adapted to translate seasonal scoring to win/loss records in many sports other than baseball including basketball (Kubatko et al. 2007; Winston 2012; Kubatko 2013; Statis Ticator 2015), football (Schatz 2003; Winston 2012), and hockey (Cochran and Blackstock 2009). The Pythagorean win-expectancy model has also been adapted to study overtime in various sports (Rosenfeld et al. 2010). While the majority of published research regarding this model has focused on empirical issues such as parameter estimation and goodness-of-fit (e.g. Braunstein 2010), some have pursued more theoretical inquiries. Miller (2007) was the first to show that if the probability distributions of the number of points scored by opposing teams in games follow independent Weibull distributions, the resulting probability of winning a game corresponds to the Pythagorean model for the fraction of games won (see also Miller et al. 2014), while Dayaratna and Miller (2012) and Miller et al. (2014) produced a simple yet very accurate linear approximation of the Pythagorean model via its first-order Taylor series expansion. However, no explanation has been offered for why different sports yield different Pythagorean model results beyond “that’s what the data say.” This is unsatisfying. While scoring records do not translate to win percentages the same way in different sports, and though the Pythagorean model tracks such differences across sports empirically, the model should explain why such differences result in a manner that reflects known differences between sports.
The contribution of this paper is to offer just such an explanation, and provides the following insight: for a given sport, the single parameter of the Pythagorean model depends upon the typical scoring margins of victory and defeat in a game, in addition to the average number of points scored in total. Both scoring margins and point totals differ by sport. For example, while it is common to see a final score of 4-2 in baseball, it is extremely rare to see a score of 100-50 in basketball. The typical margins of victory and defeat in baseball are numerically close to the average number of runs scored in a game, whereas in basketball, the winning margins are much, much smaller than the number of points scored per game. In this paper, arguing from first principles, we will show how this argument is embedded in the Pythagorean model. In particular, we will show that the ratio of the Pythagorean coefficients for two different sports is approximately equal to the ratio of average points over average winning margin for the first sport divided by the average points-to-winning-margin ratio for the second sport. The different Pythagorean coefficients for different sports thus tell a story, in that they reveal how scoring margins together with total points combine to produce winning records.
The remainder of the paper proceeds as follows: in the next section, we briefly review the mathematics of the Pythagorean model including its linearization via Taylor series. Following this, in Section 3 we derive an exact linear model from first principles for the win percentage of an individual team as a function of scoring for and against that team. This team-specific model requires no assumptions governing the probability distribution of scoring for or against, nor must we assume that total scoring by a team is independent of total scoring against a team. The only assumption required is that games cannot end in a tie. While this model is exact, it is team-specific, yet the Pythagorean model we seek to explain is of course cross-sectional across teams in a league. In Section 4, we reconcile this exact analysis with the first-order Taylor expansion of Pythagoras, which leads to a decomposition of the Pythagorean coefficient in terms of scoring margins and their relation to total points scored. We present examples of this decomposition for professional baseball, basketball, football and hockey in Section 5, while Section 6 concludes.
2 Pythagorean win expectancy model
The Pythagorean model as formulated by James (1980) related a team’s seasonal win percentage (WP) to that team’s total runs scored (RS) and runs against (RA) via
Note that one can divide both RS and RA by 162 (the number of games in a baseball season) without changing the left hand side, which allows interpreting RS and RA as the average number of runs per game scored for and against a team.
While James found that this formula worked well for baseball, he did not provide a theory or mechanism that resulted in this formula; rather his result stemmed from his remarkable ability to observe empirical regularities in baseball data. Lacking a justification for squaring both RS and RA, many realized that it was a simple matter to “tune” this formula to provide a better fit to observed data. This more general form of the Pythagorean model can be written as
which of course reduces to the original model when γ = 2. Over time, baseball analysts have concluded that 1.83 represents a better value for γ (including Bill James apparently, see Davenport and Woolner 1999, and Miller et al. 2014 among others).
Equation (2) invites application to other sports, with the understanding that RS and RA now refer to the average number of points per game scored for and against a team in sports like basketball (Kubatko et al. 2007; Winston 2012; Kubatko 2013; Statis Ticator 2015) and football (Schatz 2003; Winston 2012), or goals per game in hockey (Cochran and Blackstock 2009). Not surprisingly, the relationship between scoring and winning is best described by different values of γ for different sports. For example, in basketball, γ ≈ 14 (Kubatko 2013), while in football, γ ≈ 2.37 (Schatz 2003). That scoring in baseball, basketball and football are different is clear to all, thus it is not surprising that the resulting estimates of γ also differ. Not clear, however, is why the γ’s differ the way that they do. For example, why should basketball’s γ be about 7.5 times higher than baseball’s? We return to this question in Section 4.
2.1 Taylor series approximation
A key tool in our approach to understanding γ is the first-order Taylor series expansion of equation (2), previously reported by Dayaratna and Miller (2012) and Miller et al. (2014). Letting Rtotal denote the average number of runs per team per game over all games in a season (which roughly equals 4.3 for baseball, Miller et al. 2014), the first-order expansion of equation (2) is given by
Dayaratna and Miller (2012) show that simple linear regressions of win percentage versus the difference between runs for and against across teams (see Jones and Tappin 2005 for such regressions) result in slope estimates that are extremely close to value of γ/(4 × Rtotal), as must be the case if the Pythagorean model accurately captures the relationship between winning and scoring. Equation (3) also implies an important method for approximating γ. Imagine running the following simple linear regression using all teams in a season
and obtaining the slope estimate (note that assuming all games are played, it must be that = 1/2, for the total number of runs scored by all teams equals the total number of runs scored against all teams, and over all teams the average win percentage must equal 1/2). Then the Pythagorean coefficient can be estimated as
which will help unlock the puzzle of what the Pythagorean model is doing when it translates scoring to winning.
3 Exact win expectancy model
The Pythagorean model and its first-order approximation apply cross-sectionally across teams in a league across regular season play. In this section we shift our attention to an exact model for the win percentage of an individual team over the course of a season. For any particular team, consider a randomly selected game, and let random variable X denote the spread, that is, the difference between runs (or more generally points) for and against that team in a game. Note that we can estimate the mean spread per game at the end of a season by
where as before we interpret RS and RA as the average runs (points) per game for and against the team in question.
In a game selected at random, the team of interest wins the game if and only if X > 0 (the team outscores its opponents), and conversely the team loses if X < 0. Our single assumption is that ties are not possible, that is, X ≠ 0 (note that this assumption is also invoked in the Pythagorean model). Consequently, the probability that a team wins the game is given by
and this win probability can be estimated by the teams seasonal win percentage, that is,
Now, define a team’s expected margin of victory by
and similarly define a team’s expected margin of defeat by
These definitions tell us, on average, by how much a team wins when it wins, and by how much a team loses when it loses. Each can be estimated simply from seasonal data: to estimate MOV for a given team, simply tally total runs scored minus runs against in games that the team of interest wins, and divide by the number of wins. To estimate MOD, tally total runs against minus runs scored, and divide by the number of losses.
With these definitions, we invoke the law of total expectation to write
and after dividing by (MOD + MOV) and rearranging terms, we arrive at the desired result:
This linear equation exactly relates the probability of a team winning to its expected point differential. Note that the derivation is completely general, and in particular does not require assuming particular probability distributions for the number of points scored for or against a team, or that such scoring be independent. The only assumption is that games do not end in a tie (that is, ).
Equation (12) is also exact for each of the n teams in the league after substituting team-specific seasonal estimates for the various parameters, that is
where WPi, RSi and RAi are the observed win percentage, runs scored and runs against while modi and movi are the observed average margins of victory and defeat respectively for the ith team, . Equation (13) is illustrated in Figure 1 for the 2016 MLB season (data from http://baseball-reference.com). In the figure, there are n = 30 straight lines, each one representing a different team. The intercept ai and slope bi for the ith team are given by
There are also 30 points, one on each line, that represent the exact win percentage and average run differential per game for each team. Note that while the intercepts differ across teams, the average of these intercepts is clearly close to 1/2. Also note that the slopes are quite close numerically, which implies that modi + movi is roughly constant across teams, as implied by the nearly-parallel lines in Figure 1.
which in turn suggests that
for i = 1, 2, … , n. However, this is not correct, for while equation (13) is exact on a team-by-team basis, equation (3) applies cross-sectionally across the teams. Indeed, as argued earlier, equation (3) can be thought of as the regression line through the 30 individual points in Figure 1; Figure 2 superpositions this regression line on Figure 1. As is clear, while the intercept of this line equals 0.5 as indeed it must, the slope is attenuated from the team-specific values. Still, equation (18) suggests that the Pythagorean parameter γ depends upon scoring margins in addition to total scoring. The question is how to move from the within-team exact model to the Pythagorean model which is cross-sectional across teams. We address this in the next section.
4 Decomposing Pythagoras
Our approach to reconciling the exact and linearized Pythagorean model proceeds by substituting the exact equation (13) for each team on the left-hand side of equation (4), solving analytically for the estimated regression slope in terms of exact model properties, and using equation (5) to arrive at the Pythagorean parameter estimate . To simplify notation, let: observed average run differential per game over the course of a season for the ith team; seasonal win percentage for the ith team; and recall the definitions of ai and bi from equations (14, 15).
Now, as is well known, the estimated regression slope in the model is given by
is the sample covariance between run differential and win percentage, and is the sample variance of run differential. However, owing to the exact model, for any team i we have
which, upon substitution into equation (19), yields
where in deriving this result we have used the fact that owing to the equality over all teams of points scored for and against. To understand the terms on the right-hand side of equation (22), note that
the empirical average of the exact model slopes across all teams. The second term on the right-hand side of equation (22) follows from recognizing that the ratio is exactly the estimated slope for the regression of the ai’s against the xi’s, and captures how the ratio of changes with run (score) differential xi across teams. The third term follows from noting that
Equation (25) clarifies how the Pythagorean parameter depends upon the scoring characteristics of whatever sport is in question. Other things being equal, we see that not only does increase with the average number of points scored per game (Rtotal), it also increases with , which itself declines with scoring margin. The term is the rate with which the ratio of modi to changes with score differential xi across teams, and as can be seen from Figures 1 and 2, teams with higher average net scores (higher values of xi = RSi − RAi) have lower values of modi/(modi + movi). This implies that . Simply stated, better teams have higher average net scores, larger margins of victory (so when they win they win by more), and smaller margins of defeat (so when they lose they lose by less). Finally, as we will demonstrate numerically in the next section, but as can also be inferred from Figures 1 and 2, the term essentially equals zero. This follows from noting that the lines in Figure 1 are essentially parallel, meaning that the slopes bi in the exact model are essentially invariant with score differential (xi), and hence also invariant with .
These points are illustrated graphically in Figure 3 for the 2016 MLB season. There are three gray lines in Figure 3, each denoting a different contribution to the (first-order) Pythagorean win percentage. The horizontal line α = 0.5 sets the average win percentage. The increasing line shows how win percentage increases with average run differential at a slope that depends upon margins of victory and defeat; this is essentially the average of the 30 lines in Figures 1 and 2 subtracted from 1/2. The decreasing line shows how win percentage declines on account of the declining ratio of with run differential. This line captures the effect of hopping from higher to lower lines in Figures 1 and 2 as run differential increases. As discussed above, we ignore ≈ 0. Adding the three three gray lines together yields the black line that represents the (first-order) Pythagorean model, and as was already shown, this line provides an excellent fit to the observed win percentages, shown as black dots in Figure 3.
5 Application to baseball, basketball, football and hockey
Having worked through the theory of decomposing Pythagoras, it is a simple matter to apply this decomposition to different seasons in different sports. Our purpose in doing so is to both apply our decomposition results to different sports, but also to see what the Pythagorean coefficients tell us about scoring in different sports. We will discover empirically that the ratio of the Pythagorean coefficients across two sports approximately equals the ratio of across these same sports. As depends upon both average points scored and margins of victory and defeat, we have a way of understanding why some sports have larger Pythagorean coefficients compared to others.
Table 1 reports results for the past ten MLB seasons (data from http://baseball-reference.com). There are several points worth noting in these results. First, all components of the decomposition are quite stable: plotting , , and over time would yield four nearly flat lines. Second, as argued in the last section, the rate with which changes with net scoring xi is negative. Third, the term is very small in absolute value compared to (at most 5%), and is often within two standard errors of zero. Fourth, in absolute value, is about 30% of , which suggests a further approximation for MLB:
The importance of this approximation is that it reveals the dependence of the estimated Pythagorean parameter on two key sports quantities: the average number of points per game (Rtotal) and the winning margin as expressed via (which is the average of the reciprocals of modi + movi).
Table 2 reports the results of applying our decomposition to NBA seasons since 2007 (data from http://basketball-reference.com; we have omitted the strike-shortened 2012 season when only 66 instead of 82 games were played). Similar to the baseball results from Table 1, we see stability in all elements of the decomposition over time, for all years, and is again very small in absolute value compared to and often statistically not different from zero. But perhaps most interesting, we see that in absolute value, is again about 30of (the average ratio is 28.5%), which means that equation (26) is also quite accurate for basketball. This suggests something quite fundamental about how scoring translates to winning in basketball versus baseball, for via equation (26), the ratio of the Pythagorean parameter for basketball to baseball should approximately equal the ratio of for basketball to baseball. While Rtotal is the average number of points scored per game, can heuristically be thought of as the reciprocal of the average winning margin (it is really the average of the reciprocals rather than the reciprocal of the average). This means that the Pythagorean gammas should roughly be in proportion to the ratio of scoring to scoring margin for baseball and basketball. From Table 1, we see that for baseball the average values for , and Rtotal equal 1.77, 0.1486, and 4.40 respectively. From Table 2, the same quantities for basketball average 13.11, 0.0475, and 100.06. The ratio of the average ’s for basketball to baseball thus equals . The ratio of the product of the average and Rtotal for basketball to baseball equals , which is very close. What the Pythagorean model seems to be saying is that the way scoring translates to winning depends upon two key quantities: the average number of points per game, and the average winning margin (more precisely, the sum of the average margins of victory and defeat).
Table 3 presents results for NFL football seasons since 2007 (data from http://pro-football-reference.com). The same observations again hold: stability in all components of the decomposition over time, for all years, and truly insignificant values of in all years. However, the absolute ratio of equals about 37for football, which is larger than the 30% found for baseball and basketball. Still, it is inviting to see how the Pythagorean coefficient ratios for football and other sports compare to the ratios of average average Rtotal. For football, the average values for , and Rtotal equal 2.51, 0.0464 and 22.37 respectively. Comparing football to baseball, we see that the ratio of the average ’s equals 2.51/1.77 = 1.42, while the ratio of the product of the average and Rtotal for football to baseball equals (0.0464 × 22.37)/(0.1486 × 4.4) = 1.59, which is close to the ratio of the Pythagorean coefficients. Comparing basketball to football, the ratio of the average ’s equals 13.11/2.51 = 5.22, while the ratio of the average × average Rtotal for basketball to football is given by , which is less close but still in the ballpark (or the court).
Finally, Table 4 presents results for NHL hockey seasons since 2007 (data from https://www.hockey-reference.com/). Again one sees stability in all components of the decomposition over time, for all years, and insignificant values of in all years. The absolute ratio of averages about 26% for hockey, which is closer to the 30% ratio found for baseball and basketball than the 37% ratio for football. Taking the ratio of the average Pythagorean coefficient for hockey (2.03) to the same for baseball, basketball and football respectively yields 1.15, 0.15 and 0.81. Now taking the ratio of average × average Rtotal for hockey (0.69) to the same for baseball, basketball and football respectively yields 1.06, 0.15 and 0.67. These calculations suggest that while the Pythagorean “story” relating win percentage to both scoring totals and margins of victory and defeat works well when comparing baseball, basketball and hockey, the results are less satisfying for football.
Figure 4 plots the estimated Pythagorean γ’s for baseball, football, hockey (left vertical axis) and basketball (right vertical axis) over time. While there is some year-to-year variation, by sport the magnitudes of these coefficients are quite stable over time. As explained by our decomposition, the Pythagorean model does capture the differences between scoring (and scoring margins) in different sports.
The Pythagorean win expectancy model remains one of the most celebrated tools in sports analytics, and while many have documented its ability to approximate win percentages from seasonal scoring records, very little has been written in sports-specific terms regarding what this model really does, and why it produces different results for different sports. Here we have offered an original explanation: the single coefficient in the Pythagorean model effectively captures the ratio of average scoring to the sum of the margins of victory and defeat. These characteristics clearly differ by sport, and the Pythagorean coefficient estimates for different sports capture this difference. We discovered this result from first principles – we first derived an exact within-team model relating win percentage to scoring differential, and mathematically reconciled this model with the Pythagorean model which is cross-sectional across teams – and showed that at least for baseball, basketball, football and hockey, observed data give rise to Pythagorean coefficient estimates that agree with our analytical claims.
It is remarkable that Bill James deduced the Pythagorean model for baseball based solely on observing empirical patterns in the data. James is well known for having intuited many things about sports more generally. In deference to his insights, we close with the following anecdote reported by Weinbaum (2013): when Daryl Morey applied the Pythagorean model to basketball, he was working part-time for STATS Inc., which was co-founded by Bill James. Upon learning of Morey’s result, James remarked (Weinbaum 2013): “I would never have guessed that you could adapt the Pythagorean to basketball. Basketball has very small margins, relative to the score. A top baseball team scores 25 percent more runs than it allows, but a top basketball team outscores its opponents by only 6 to 7 percent.” Viewing his statements in light of the results in this paper, it appears that James understood the sports fundamentals of his Pythagorean model more than even he realized.
Dayaratna, K. and S. J. Miller. 2012. “First Order Approximations of the Pythagorean Won-Loss Formula for Predicting MLB Teams Winning Percentages.” By the Numbers – The Newsletter of the SABR Statistical Analysis Committee 22:15–19.Search in Google Scholar
Davenport, C. and K. Woolner. 1999. “Revisiting the Pythagorean Theorem.” Baseball Prospectus.http://www.baseballprospectus.com/article.php?articleid=342. Accessed on June 11, 2017.Search in Google Scholar
James, B. 1980. 1980 Baseball Abstract. Lawrence, KS: Self-published.Search in Google Scholar
Jones, M. and L. Tappin. 2005. “The Pythagorean Theorem of Baseball and Alternative Models.” The UMAP Journal 26(1):23–34.Search in Google Scholar
Kubatko, J. 2013. “Pythagoras of the Hardwood.” Statitudes. http://statitudes.com/?s=Pythagoras+of+the+Hardwood. Accessed on June 10, 2017.Search in Google Scholar
Kubatko, J., D. Oliver, K. Pelton and D. T. Rosenbaum. 2007. “A Starting Point for Analyzing Basketball Statistics.” Journal of Quantitative Analysis in Sports 3(3):1–24.10.2202/1559-0410.1070Search in Google Scholar
Miller, S. J., T. Corcoran, J. Gossels, V. Luo and J. Porfilio. 2014. “Pythagoras at the Bat.” Pp. 89–113 in Social Networks and the Economics of Sports, edited by P. M. Pardalos and V. Zamaraev. Cham, Switzerland: Springer International.10.1007/978-3-319-08440-4_6Search in Google Scholar
Rosenfeld, J. W., J. I. Fisher, D. Adler and C. Morris. 2010. “Predicting Overtime with the Pythagorean Formula.” Journal of Quantitative Analysis in Sports 6(2):1–19.10.2202/1559-0410.1244Search in Google Scholar
Schatz, A. 2003. “Pythagoras on the Gridiron.” Football Outsiders. http://www.footballoutsiders.com/stat-analysis/2003/pythagoras-gridiron. Accessed on June 10, 2017.Search in Google Scholar
Severini, T. A. 2015. Analytic Methods in Sports. Boca Raton, FL: CRC Press.Search in Google Scholar
Statis Ticator. 2015. “Morey’s Law: How Do Points Scored and Points Allowed Tie to Win Percentage?” Statisticator. . Accessed on June 10, 2017.Search in Google Scholar
Weinbaum, W. 2013. “Moreyball.” Northwestern. http://www.northwestern.edu/magazine/winter2013/feature/moreyball.html. Accessed on June 13, 2017.Search in Google Scholar
©2017 Walter de Gruyter GmbH, Berlin/Boston