Abstract
A rating system provides relative measures of superiority between adversaries. We propose a novel and simple approach, which we call pi-rating, for dynamically rating Association Football teams solely on the basis of the relative discrepancies in scores through relevant match instances. The pi-rating system is applicable to any other sport where the score is considered as a good indicator for prediction purposes, as well as determining the relative performances between adversaries. In an attempt to examine how well the ratings capture a team’s performance, we have a) assessed them against two recently proposed football ELO rating variants and b) used them as the basis of a football betting strategy against published market odds. The results show that the pi-ratings outperform considerably the widely accepted ELO ratings and, perhaps more importantly, demonstrate profitability over a period of five English Premier League seasons (2007/2008–2011/2012), even allowing for the bookmakers’ built-in profit margin. This is the first academic study to demonstrate profitability against market odds using such a relatively simple technique, and the resulting pi-ratings can be incorporated as parameters into other more sophisticated models in an attempt to further enhance forecasting capability.
We acknowledge the financial support by the Engineering and Physical Sciences Research Council (EPSRC) for funding this research, and the reviewers and Editors of this Journal whose comments have led to significant improvements in the paper.
Appendix
Appendix A Rating development over a period of 20 seasons.

Rating development over a period of 20 seasons, assuming λ=0.035 and γ=0.7, for the six most popular EPL teams (from season 1992/1993 to season 2011/2012 inclusive).
Appendix B Learning rates λ and γ
Squared error values generated based on learning rates λ and γ.
γ | λ | |||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0.05 | 0.10 | 0.15 | 0.20 | 0.25 | 0.30 | 0.35 | 0.40 | 0.45 | 0.50 | 0.55 | 0.60 | 0.65 | 0.70 | 0.75 | 0.80 | 0.85 | 0.90 | 0.95 | 1.00 | |
0.005 | 2.8436 | 2.8365 | 2.8301 | 2.8243 | 2.8192 | 2.8148 | 2.8111 | 2.8082 | 2.8061 | 2.8050 | 2.8049 | 2.8059 | 2.8082 | 2.8119 | 2.8173 | 2.8246 | 2.8340 | 2.8461 | 2.8616 | 2.8809 |
0.010 | 2.7273 | 2.7212 | 2.7156 | 2.7105 | 2.7057 | 2.7015 | 2.6978 | 2.6946 | 2.6921 | 2.6902 | 2.6891 | 2.6890 | 2.6902 | 2.6930 | 2.6981 | 2.7064 | 2.7193 | 2.7389 | 2.7680 | 2.8115 |
0.015 | 2.6847 | 2.6800 | 2.6756 | 2.6716 | 2.6680 | 2.6647 | 2.6617 | 2.6591 | 2.6568 | 2.6549 | 2.6535 | 2.6527 | 2.6527 | 2.6538 | 2.6567 | 2.6624 | 2.6730 | 2.6918 | 2.7261 | 2.7896 |
0.020 | 2.6658 | 2.6618 | 2.6582 | 2.6548 | 2.6517 | 2.6489 | 2.6463 | 2.6439 | 2.6418 | 2.6399 | 2.6384 | 2.6372 | 2.6366 | 2.6368 | 2.6381 | 2.6417 | 2.6494 | 2.6659 | 2.7012 | 2.7800 |
0.025 | 2.6561 | 2.6526 | 2.6493 | 2.6462 | 2.6434 | 2.6408 | 2.6384 | 2.6362 | 2.6342 | 2.6325 | 2.6310 | 2.6297 | 2.6289 | 2.6286 | 2.6293 | 2.6315 | 2.6371 | 2.6506 | 2.6842 | 2.7760 |
0.030 | 2.6518 | 2.6485 | 2.6453 | 2.6424 | 2.6397 | 2.6372 | 2.6348 | 2.6327 | 2.6308 | 2.6290 | 2.6275 | 2.6263 | 2.6254 | 2.6250 | 2.6253 | 2.6268 | 2.6309 | 2.6415 | 2.6723 | 2.7749 |
0.035 | 2.6506 | 2.6473 | 2.6442 | 2.6414 | 2.6387 | 2.6362 | 2.6339 | 2.6318 | 2.6300 | 2.6283 | 2.6269 | 2.6258 | 2.6251 | 2.6247 | 2.6249 | 2.6258 | 2.6286 | 2.6369 | 2.6644 | 2.7758 |
0.040 | 2.6510 | 2.6478 | 2.6448 | 2.6421 | 2.6396 | 2.6372 | 2.6349 | 2.6328 | 2.6310 | 2.6295 | 2.6281 | 2.6270 | 2.6262 | 2.6257 | 2.6257 | 2.6264 | 2.6287 | 2.6355 | 2.6598 | 2.7782 |
0.045 | 2.6527 | 2.6497 | 2.6467 | 2.6440 | 2.6414 | 2.6389 | 2.6368 | 2.6348 | 2.6330 | 2.6315 | 2.6302 | 2.6291 | 2.6284 | 2.6279 | 2.6277 | 2.6282 | 2.6300 | 2.6359 | 2.6577 | 2.7818 |
0.050 | 2.6554 | 2.6524 | 2.6494 | 2.6466 | 2.6440 | 2.6416 | 2.6395 | 2.6376 | 2.6358 | 2.6343 | 2.6331 | 2.6321 | 2.6313 | 2.6307 | 2.6305 | 2.6310 | 2.6327 | 2.6376 | 2.6572 | 2.7865 |
0.055 | 2.6591 | 2.6559 | 2.6530 | 2.6502 | 2.6476 | 2.6453 | 2.6431 | 2.6412 | 2.6394 | 2.6379 | 2.6367 | 2.6356 | 2.6348 | 2.6343 | 2.6343 | 2.6348 | 2.6362 | 2.6405 | 2.6579 | 2.7918 |
0.060 | 2.6635 | 2.6603 | 2.6574 | 2.6546 | 2.6519 | 2.6496 | 2.6475 | 2.6454 | 2.6436 | 2.6421 | 2.6408 | 2.6398 | 2.6390 | 2.6386 | 2.6386 | 2.6392 | 2.6407 | 2.6445 | 2.6598 | 2.7979 |
0.065 | 2.6685 | 2.6653 | 2.6622 | 2.6594 | 2.6568 | 2.6546 | 2.6524 | 2.6503 | 2.6485 | 2.6468 | 2.6454 | 2.6444 | 2.6436 | 2.6433 | 2.6435 | 2.6441 | 2.6456 | 2.6492 | 2.6628 | 2.8044 |
0.070 | 2.6740 | 2.6707 | 2.6676 | 2.6647 | 2.6621 | 2.6598 | 2.6576 | 2.6555 | 2.6536 | 2.6518 | 2.6504 | 2.6493 | 2.6487 | 2.6484 | 2.6487 | 2.6493 | 2.6509 | 2.6545 | 2.6668 | 2.8113 |
0.075 | 2.6799 | 2.6766 | 2.6735 | 2.6706 | 2.6679 | 2.6655 | 2.6631 | 2.6609 | 2.6589 | 2.6572 | 2.6559 | 2.6549 | 2.6542 | 2.6540 | 2.6542 | 2.6550 | 2.6569 | 2.6605 | 2.6718 | 2.8187 |
0.080 | 2.6862 | 2.6828 | 2.6797 | 2.6769 | 2.6741 | 2.6715 | 2.6691 | 2.6669 | 2.6648 | 2.6632 | 2.6620 | 2.6610 | 2.6603 | 2.6600 | 2.6603 | 2.6613 | 2.6633 | 2.6671 | 2.6775 | 2.8264 |
0.085 | 2.6929 | 2.6895 | 2.6864 | 2.6833 | 2.6805 | 2.6779 | 2.6757 | 2.6734 | 2.6713 | 2.6698 | 2.6684 | 2.6674 | 2.6667 | 2.6666 | 2.6670 | 2.6680 | 2.6701 | 2.6738 | 2.6838 | 2.8345 |
0.090 | 2.6999 | 2.6965 | 2.6932 | 2.6902 | 2.6874 | 2.6849 | 2.6825 | 2.6803 | 2.6783 | 2.6767 | 2.6754 | 2.6743 | 2.6737 | 2.6736 | 2.6740 | 2.6751 | 2.6772 | 2.6809 | 2.6906 | 2.8429 |
0.095 | 2.7072 | 2.7037 | 2.7005 | 2.6974 | 2.6946 | 2.6920 | 2.6896 | 2.6875 | 2.6856 | 2.6839 | 2.6826 | 2.6816 | 2.6811 | 2.6811 | 2.6815 | 2.6826 | 2.6847 | 2.6884 | 2.6977 | 2.8518 |
0.100 | 2.7147 | 2.7113 | 2.7080 | 2.7048 | 2.7020 | 2.6995 | 2.6971 | 2.6951 | 2.6933 | 2.6917 | 2.6903 | 2.6893 | 2.6889 | 2.6889 | 2.6895 | 2.6906 | 2.6926 | 2.6963 | 2.7051 | 2.8610 |
Appendix C Description of the ratings ELOb and ELOg
In this section we provide a brief description of the ratings ELOb and ELOg as defined by the authors of the ratings (Hvattum and Arntzen 2010).
C.1 Description of ELOb
Let



and the parameters c and d serve only to set a scale of the ratings. The authors suggest that we use c=10 and d=400 (but alternative values of c and d give identical rating systems). Assuming that the score for the home team follows:

Then the actual score for the away team is αA=1–αH. At the end of the match, the revised ELO rating for the home team is (the away team’s


with k=20 as a suitable parameter value.
C.2 Description of ELOg
The ELOg rating is a variant of ELOb above, in an attempt to also consider score difference, with the difference that k is replaced by the expression:

where δ is the absolute goal difference, and assuming k0>0 and λ>0 as fixed parameters; suggesting k0=10 and λ=1 as suitable parameter values.
Appendix D Optimised [k] and [k0, λ] values for the ratings ELOb and ELOg.

Optimised k0 and λ values for ELOg. Minimum squared error of expected goal difference observed when k0=2 and λ=2.8, where e=0.3405.

Optimised k-value for ELOb. Minimum squared error of expected goal difference observed when k=56, where e=0.3514.
- 1
It might also worth mentioning that the ELO rating algorithm was featured prominently in the popular movie The Social Network (also known as the Facebook movie), whereby during a scene Eduardo Saverin writes the mathematical formula for the ELO rating system on Zuckerberg’s dorm room window.
- 2
If the rating is applied to a single league competition, the average team in that league will have a rating of 0. If the rating is applied to more than one league in which adversaries between the different leagues (or cup competitions) play against each other, the average team over all leagues will have a rating of 0.
- 3
If the prediction is +4 in favour of the home side then an actual result of 5–0 will give you an error of approximately 1. But if the prediction is 0 in favour of the home side and the actual result is 1–0, then this also gives you the same error as above.
- 4
The learning parameters could have been optimised based on predictions of type {H, D, A} (corresponding to home win, draw and away win), based on profitability, based on scoring rules, or based on many other different accuracy measurements and metrics. We have chosen score difference for optimising the learning parameters since the pi-ratings themselves are exclusively determined by that information.
- 5
The first five EPL seasons (1992/1993 to 1996/1997) are solely considered for generating the initial ratings for the competing teams. This is important because training the model on ignorant team ratings (i.e., starting from 0) will negatively affect the training procedure. Thus, learning parameters λ and γ are trained during the subsequent ten seasons; 1997/1998 to 2006/2007 inclusive.
- 6
For the pi-rating system the ratings are segregated into intervals of 0.10 (from ≤–1.1 to >1.6), for ELOb the ratings are segregated into intervals of 25 (from ≤–330 to >345), and for ELOg the ratings are segregated into intervals of 35 (from ≤–475 to >470).
- 7
Assumes a profit margin of 5%.
- 8
For the newly promoted team Wolves the development of the ratings start at match instance 760 since no performances have been recorded relative to the residual EPL teams during the two preceding seasons.
- 9
Where the pi-ratings of the home and away team follow ∼Normal (x, y) distributions for capturing rating uncertainty, where x is the pi-rating value (RαH or RβA) and y is the pi-rating variance, which can be measured over n preceding match instances.
References
Baio, G., and M. Blangiardo. 2010. “Bayesian Hierarchical Model for the Prediction of Football Results.” Journal of Applied Statistics 37(2):253–264.10.1080/02664760802684177Search in Google Scholar
Buchner, A., W. Dubitzky, A. Schuster, P. Lopes, P. O’Doneghue, J. Hughes, D. A. Bell, K. Adamson, J. A. White, J. M. C. C. Anderson and M. D. Mulvenna. 1997. Corporate Evidential Decision Making in Performance Prediction Domains. Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI ’97). Providence, Rhode Island, USA: Brown University.Search in Google Scholar
Clarke, S. R. and J. M. Norman. 1995. “Home Ground Advantage of Individual Clubs in English Soccer.” The Statistician 44:509–521.10.2307/2348899Search in Google Scholar
Constantinou, A. C. and N. E. Fenton. 2012. Evidence of an (Intended) Inefficient Association Football Gambling Market.Under Review. Draft available at: http://constantinou.info/downloads/papers/evidenceofinefficiency.pdf.Search in Google Scholar
Constantinou, A. C., N. E. Fenton, and M. Neil. 2012a. “pi-football: A Bayesian Network Model for Forecasting Association Football Match Outcomes”. Knowledge-Based Systems, 322–339. Draft available at: http://www.constantinou.info/downloads/papers/pi-model11.pdf.10.1016/j.knosys.2012.07.008Search in Google Scholar
Constantinou, A. C., N. E. Fenton, and M. Neil. 2012b. Profiting from an Inefficient Association Football Gambling Market: Prediction, Risk and Uncertainty Using Bayesian Networks.Under Review. Draft available at: http://www.constantinou.info/downloads/papers/pi-model12.pdf.10.1016/j.knosys.2013.05.008Search in Google Scholar
Crowder, M., M. Dixon, A. Ledford and M. Robinson. 2002. “Dynamic Modelling and Prediction of English Football League Matches for Betting.” The Statistician 51:157–168.10.1111/1467-9884.00308Search in Google Scholar
Dixon, M., and S. Coles. 1997. “Modelling Association Football Scores and Inefficiencies in the Football Betting Market.” Applied Statistics 46:265–280.10.1111/1467-9876.00065Search in Google Scholar
Dixon, M., and P. Pope. 2004. “The Value of Statistical Forecasts in the UK Association Football Betting Market.” International Journal of Forecasting 20:697–711.10.1016/j.ijforecast.2003.12.007Search in Google Scholar
Dunning, E. 1999. Sport Matters: Sociological Studies of Sport, Violence and Civilisation. London: Routledge.Search in Google Scholar
Dunning, E. G., A Joseph and R.E. Maguire. 1993. The Sports Process: A Comparative and Developmental Approach. p. 129. Champaign: Human Kinetics.Search in Google Scholar
Elo, A. E. 1978. The Rating of Chess Players, Past and Present. New York: Arco Publishing.Search in Google Scholar
Fenton, N. E. and M. Neil. 2012. Risk Assessment and Decision Analysis with Bayesian Networks. London: Chapman and Hall.10.1201/9780367803018Search in Google Scholar
FIFA. 2012. FIFA. Retrieved March 27, 2012, from FIFA/Coca-Cola World Ranking Procedure: http://www.fifa.com/worldranking/procedureandschedule/menprocedure/index.html.Search in Google Scholar
Football-Data. 2012. Football-Data.co.uk. Retrieved August 2, 2012, from Football Results, Statistics & Soccer Betting Odds Data: http://www.football-data.co.uk/englandm.php.Search in Google Scholar
Forrest, D., J. Goddard and R. Simmons. 2005. “Odds-Setters as Forecasters: The Case of English Football.” International Journal of Forecasting 21:551–564.10.1016/j.ijforecast.2005.03.003Search in Google Scholar
Goddard, J. 2005. “Regression Models for Forecasting Goals and Match Results in Association Football.” International Journal of Forecasting 21:331–340.10.1016/j.ijforecast.2004.08.002Search in Google Scholar
Goddard, J. and I. Asimakopoulos. 2004. “Forecasting Football Results and the Efficiency of Fixed-odds Betting.” Journal of Forecasting 23:51–66.10.1002/for.877Search in Google Scholar
Halicioglu, F. 2005a. “Can We Predict the Outcome of the International Football Tournaments?: The Case of Euro 2000.” Doğuş Üniversitesi Dergisi 6:112–122.10.31671/dogus.2019.288Search in Google Scholar
Halicioglu, F. 2005b. Forecasting the Professional Team Sporting Events: Evidence from Euro 2000 and 2004 Football Tournaments. 5th International Conference on Sports and Culture: Economic, Management and Marketing Aspects. Athens, Greece, pp. 30–31.Search in Google Scholar
Harville, D. A. 1977. “The Use of Linear-model Methodology to Rate High School or College Football Teams.” Journal of American Statistical Association 72:278–289.10.1080/01621459.1977.10480991Search in Google Scholar
Hirotsu, N. and M. Wright. 2003. “An Evaluation of Characteristics of Teams in Association Football by Using a Markov Process Model.” The Statistician 52(4):591–602.10.1046/j.0039-0526.2003.00437.xSearch in Google Scholar
Hvattum, L. M. and H. Arntzen. 2010. “Using ELO Ratings for Match Result Prediction in Association Football.” International Journal of Forecasting 26:460–470.10.1016/j.ijforecast.2009.10.002Search in Google Scholar
Joseph, A., N. Fenton and M. Neil. 2006. “Predicting Football Results Using Bayesian Nets and Other Machine Learning Techniques.” Knowledge-Based Systems 7:544–553.10.1016/j.knosys.2006.04.011Search in Google Scholar
Karlis, D. and I. Ntzoufras. 2000. “On Modelling Soccer Data.” Student 229–244.Search in Google Scholar
Karlis, D. and I. Ntzoufras. 2003. “Analysis of Sports Data by Using Bivariate Poisson Models.” The Statistician 52(3): 381–393.10.1111/1467-9884.00366Search in Google Scholar
Knorr-Held, L. 1997. Hierarchical Modelling of Discrete Longitudinal Data, Applications of Markov Chain Monte Carlo. Munich: Utz.Search in Google Scholar
Knorr-Held, L. 2000. “Dynamic Rating of Sports Teams.” The Statistician 49(2):261–276.10.1111/1467-9884.00236Search in Google Scholar
Koning, R. 2000. “Balance in Competition in Dutch Soccer.” The Statistician 49(3):419–431.10.1111/1467-9884.00244Search in Google Scholar
Koning, R., H. Koolhaas, M. Renes and G. Ridder. 2003. “A Simulation Model for Football Championships.” European Journal of Operational Research 148:268–276.10.1016/S0377-2217(02)00683-5Search in Google Scholar
Kuonen, D. 1996. Statistical Models for Knock-Out Soccer Tournaments. Technical Report, Department of Mathematics, Ècole Polytechnique Federale de Lausanne.Search in Google Scholar
Kuypers, T. 2000. “Information and Efficiency: an Empirical Study of a Fixed Odds Betting Market.” Applied Economics 32: 1353–1363.10.1080/00036840050151449Search in Google Scholar
Lee, A. J. 1997. “Modeling Scores in the Premier League: is Manchester United Really the Best?” Chance 10:15–19.10.1080/09332480.1997.10554791Search in Google Scholar
Leitner, C., A. Zeileis and K. Hornik. 2010. “Forecasting Sports Tournaments by Ratings of (prob)abilities: A Comparison for the EURO 2008.” International Journal of Forecasting 26:471–481.10.1016/j.ijforecast.2009.10.001Search in Google Scholar
Maher, M. J. 1982. “Modelling Association Football Scores.” Statististica Neerlandica 36:109–118.10.1111/j.1467-9574.1982.tb00782.xSearch in Google Scholar
Min, B., J. Kim, C. Choe, H. Eom, and R. B. McKay. 2008. “A Compound Framework for Sports Results Prediction: A Football Case Study.” Knowledge-Based Systems 21:551–562.10.1016/j.knosys.2008.03.016Search in Google Scholar
Mueller, F. O., R. C. Cantu and S. P. Camp. 1996. Catastrophic Injuries in High School and College Sports. Champaign: Human Kinetics, p. 57.Search in Google Scholar
Murali, V. (2011, October 28) Bleacher Report. Retrieved March 28, 2012, from World Football: 40 Biggest Scandals in Football History: http://bleacherreport.com/articles/909932-world-football-40-biggest-scandals-in-football-history.Search in Google Scholar
Poulter, D. R. 2009. “Home Advantage and Player Nationality in International Club Football.” Journal of Sports Sciences 27(8):797–805.10.1080/02640410902893364Search in Google Scholar PubMed
Reid, D. A. and M. S. Nixon. 2011. “Using Comparative Human Descriptions for Soft Biometrics.” International Joint Conference on Biometrics (IJCB) 2011.10.1109/IJCB.2011.6117513Search in Google Scholar
Rotshtein, A., M. Posner and A. Rakytyanska. 2005. “Football Predictions Based on a Fuzzy Model with Genetic and Neural Tuning.” Cybernetics and Systems Analysis 41(4):619–630.10.1007/s10559-005-0098-4Search in Google Scholar
Rue, H. and O. Salvesen. 2000. “Prediction and Retrospective Analysis of Soccer Matches in a League.” The Statistician 3:339–418.10.1111/1467-9884.00243Search in Google Scholar
Tsakonas, A., G. Dounias, S. Shtovba and V. Vivdyuk. 2002. Soft Computing-Based Result Prediction of Football Games. The First International Conference on Inductive Modelling (ICIM 2002). Lviv, Ukraine.Search in Google Scholar
©2013 by Walter de Gruyter Berlin Boston