We predict the induction of Major League Baseball hitters and pitchers into the National Baseball Hall of Fame by the Baseball Writers Association of America. We employ a Random Forest algorithm for binary classification, improving upon past models with a simplistic input approach. Our results suggest that the random forest technique is a fruitful line of research with prediction in the sports world. We find an error rate as low as 0.91% in our most accurate forest, with no out-of-bag Error higher than 2.6% in any tree ensemble. We extend the results to an examination of the possibility of discrimination with respect to BBWAA voting, finding little evidence for exclusions based on race.
©2012 Walter de Gruyter GmbH & Co. KG, Berlin/Boston