Abstract
Every shot in basketball has an opportunity cost; one player’s shot eliminates all potential opportunities from their teammates for that play. For this reason, player-shot efficiency should ultimately be considered relative to the lineup. This aspect of efficiency—the optimal way to allocate shots within a lineup—is the focus of our paper. Allocative efficiency should be considered in a spatial context since the distribution of shot attempts within a lineup is highly dependent on court location. We propose a new metric for spatial allocative efficiency by comparing a player’s field goal percentage (FG%) to their field goal attempt (FGA) rate in context of both their four teammates on the court and the spatial distribution of their shots. Leveraging publicly available data provided by the National Basketball Association (NBA), we estimate player FG% at every location in the offensive half court using a Bayesian hierarchical model. Then, by ordering a lineup’s estimated FG%s and pairing these rankings with the lineup’s empirical FGA rate rankings, we detect areas where the lineup exhibits inefficient shot allocation. Lastly, we analyze the impact that sub-optimal shot allocation has on a team’s overall offensive potential, demonstrating that inefficient shot allocation correlates with reduced scoring.
- a
The first and second authors contributed equally to this work.
Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission. This paper represents partial fulfillment of the first author’s PhD thesis requirements.
Research funding: None declared.
Conflict of interest statement: The authors declare no conflicts of interest regarding this article.
A.1 Empirical implementation
To illustrate some important considerations associated with our approach, we present an example of LPL and PLC using empirical FG% and FGA rates. This example demonstrates that our metrics are agnostic to the underlying FG% model.
We examine the Cleveland Cavaliers’ starting lineup as discussed in the main text. In order to obtain FG% and FGA rate estimates, we divide the court into 12 discrete regions and calculate the empirical FG%s and FGA rates for each player within these regions. We defined these regions based on our understanding of the court, but it is worth noting that defining these regions requires many of the same considerations as with any histogram style estimator; namely, that increasing the number of regions will decrease bias at the expense of increasing variance. In some cases, a player may have only one or two shots within an area, resulting in either unrealistically high or low FG% estimates. As an ad hoc solution to this, we give all players one made field goal and five field goal attempts within each region, which means that players with just a handful of shots in a region will have their associated FG% anchored near 20%. For the field goal attempt estimates, we simply count up the number of attempts for each player within each section, and normalize them to get the attempts per 36 min. With these FG% and FGA rate estimates, we can replicate the analysis detailed in Section 3.
Figure 14 shows the empirical ranks for this lineup, as well as the rank correspondence. Generally, it shows the same patterns as the model-based analysis in Figures 5 and 6. However, there are some key differences, including Tristan Thompson having a higher field goal percentage rank from the right midrange and a corresponding reduction in rank for Kevin Love in the same area. This pattern is also manifest in Figure 15, which shows the empirical LPL. We observe that most lineup points appear to be lost in the right midrange and in above the break three point shots. Finally, considering the empirical PLC in Figure 15, we notice that in addition to the Love-Thompson tradeoff in the midrange, JR Smith appears to be overshooting from the perimeter, while Kyrie Irving and LeBron James both exhibit undershooting.

Histogram of

Left: 20% quantile LPL surfaces for the Cleveland Cavaliers starting lineup. Middle: median LPL surfaces. Bottom: 80% quantile LPL surfaces. The top rows show LPL per 36 min while the bottom rows show LPLShot.

Posterior distribution of the effect for TGLPL in model (17)–(18) described in Section 4.2.
The persistence of the Love-Thompson connection in the midrange in this empirical analysis, and its divergence from what we saw in the model based analysis, merits a brief discussion. Kevin Love and Tristan Thompson both had a low number of shots from the far-right midrange region, with Love shooting 8 for 26 and Thompson shooting 4 for 6. Because they both shot such a low amount of shots, even with the penalty of one make and four misses added to each region, Thompson appears far better. This highlights the fact that although LPL and PLC are model agnostic, the underlying estimates for FG% do matter and raw empirical estimates alone may be too noisy to be useful in calculating LPL. One simple solution may be to use a threshold and only consider players in a region if the number of their field goal attempts passes that threshold.
A.2 Additional figures
References
Banerjee, S., B. P. Carlin, and A. E. Gelfand. 2015. Hierarchical Modeling and Analysis for Spatial Data, 2nd ed. Boca Raton, FL: CRC Press.10.1201/b17115Search in Google Scholar
Besag, J. 1974. “Spatial Interaction and the Statistical Analysis of Lattice Systems.” Journal of the Royal Statistical Society: Series B 36 (2): 192–236, https://doi.org/10.1111/j.2517-6161.1974.tb00999.x.Search in Google Scholar
Carpenter, B., A. Gelman, M. D. Hoffman, D. Lee, B. Goodrich, M. Betancourt, B. Marcus, J. Guo, P. Li, and R. Allen. 2017. “Stan: A Probabilistic Programming Language.” Journal of Statistical Software 76 (1), https://doi.org/10.18637/jss.v076.i01.Search in Google Scholar
Cervone, D., A. D’Amour, L. Bornn, and G. Kirk. 2016. “A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes.” Journal of the American Statistical Association 111 (514): 585–599, https://doi.org/10.1080/01621459.2016.1141685.Search in Google Scholar
Chang, Y. H., R. Maheswaran, J. Su, S. Kwok, T. Levy, W. Adam, and K. Squire. 2014. “Quantifying Shot Quality in the NBA.” In The 8th Annual MIT Sloan Sports Analytics Conference. Boston, MA.Search in Google Scholar
D’Amour, A., D. Cervone, L. Bornn, and G. Kirk. 2015. “Move or Die: How Ball Movement Creates Open Shots in the nba.” In Sloan Sports Analytics Conference.Search in Google Scholar
Diggle, P. 1985. “A Kernel Method for Smoothing Point Process Data.” Journal of the Royal Statistical Society. Series C (Applied Statistics) 34 (2): 138–147, https://doi.org/10.2307/2347366.Search in Google Scholar
Dixon, M. J., and S. G. Coles. 1997. “Modelling Association Football Scores and Inefficiencies in the Football Betting Market.” Journal of the Royal Statistical Society: Series C (Applied Statistics) 46 (2): 265–280, https://doi.org/10.1111/1467-9876.00065.Search in Google Scholar
Gelman, A., J. B. Carlin, H. S. Stern, D. B. Dunson, A. Vehtari, and D. B. Rubin. 2013. Bayesian Data Analysis, 3rd ed.: Chapman & Hall/CRC Texts in Statistical Science. Taylor & Francis.10.1201/b16018Search in Google Scholar
Goldman, M., and J. M. Rao. 2011. “Allocative and Dynamic Efficiency in NBA Decision Making.” In The 5th Annual MIT Sloan Sports Analytics Conference. Boston, MA.Search in Google Scholar
Hollinger, J. 2005. “Pro Basketball Forecast, 2005-06.” Potomac Books.Search in Google Scholar
Kubatko, J., D. Oliver, K. Pelton, and D. T. Rosenbaum. 2007. “A Starting Point for Analyzing Basketball Statistics.” Journal of Quantitative Analysis in Sports 3 (3), https://doi.org/10.2202/1559-0410.1070.Search in Google Scholar
Lee, D. D., and H. S. Seung. oct 1999. “Learning the Parts of Objects by Non-negative Matrix Factorization.” Nature 401: 788, https://doi.org/10.1038/44565.Search in Google Scholar
Lindgren, F., and H. Rue. 2015. “Bayesian Spatial Modelling with R-INLA.” Journal of Statistical Software 63 (19): 1–25. https://doi.org/10.18637/jss.v063.i19.Search in Google Scholar
Lindgren, F., H. Rue, and J. Lindström. 2011. “An Explicit Link between Gaussian Fields and Gaussian Markov Random Fields: The Stochastic Partial Differential Equation Approach.” Journal of the Royal Statistical Society: Series B 73 (4): 423–498, https://doi.org/10.1111/j.1467-9868.2011.00777.x.Search in Google Scholar
Miller, A., L. Bomn, R. Adams, and G. Kirk. 2014. “Factorized Point Process Intensities: A Spatial Analysis of Professional Basketball.” In 31st International Conference on Machine Learning, ICML 2014, Vol. 1, 398–414.Search in Google Scholar
Oliver, D. 2004. Basketball on Paper: Rules and Tools for Performance Analysis: Brassey’s, Incorporated.Search in Google Scholar
Sandholtz, N., and L. Bornn. 2020. Markov Decision Processes with Dynamic Transition Probabilities: An Analysis of Shooting Strategies in Basketball. arXiv preprint arXiv:1812.05170.10.1214/20-AOAS1348Search in Google Scholar
Simpson, D., J. B. Illian, F. Lindgren, S. H. Sørbye, and H. Rue. 2016. “Going off Grid: Computationally Efficient Inference for Log-Gaussian Cox Processes.” Biometrika 103 (1): 49–70. https://doi.org/10.1093/biomet/asv064.Search in Google Scholar
© 2020 Walter de Gruyter GmbH, Berlin/Boston