Jump to ContentJump to Main Navigation

Online

99,00 € / $149.00*

* Prices subject to change. Shipping costs will be added if applicable.
Publication Date:
September 2007
ISSN:
1544-6115
DOI:
10.2202/1544-6115.1309

See all formats and pricing

Online
Individual Subscription Online only
Euro [D] 99.00
RRP for USA, Canada, Mexico
US$ 149.00 *
Print
Individual Subscription Online only
Euro [D] 285.00
RRP for USA, Canada, Mexico
US$ 384.00 *
Print + Online
Individual Subscription Online only
Euro [D] 342.00
RRP for USA, Canada, Mexico
US$ 461.00 *
*Prices subject to change. Shipping costs will be added if applicable.

Editor-in-Chief: Stumpf, Michael P.H.

Editorial Board Member: Beaumont, Mark / Binder, Harald / Gupta, Mayetri / Hubbard, Alan E. / Husmeier, Dirk / Ji, Hongkai / Keles, Sunduz / Kerr, Kathleen / Lazzeroni, Laura / Lin, Shili / Ma, Ping / Marjoram, Paul / Mertens, Bart / Nerman, Olle / G. Petretto, Enrico / Plagnol, Vincent / Purdom, Elizabeth / Robin, Stéphane / Rzhetsky, Andrey / Sanguinetti, Guido / van der Laan, Mark J. / von Haeseler, Arndt / Weeks, Daniel E. / Wiuf, Carsten / Zhao, Hongyu

6 Issues per year

IMPACT FACTOR 2011: 1.517
5-year IMPACT FACTOR: 1.704
Rank 27 out of 116 in category Statistics & Probability in the 2011 Thomson Reuters Journal Citation Report/Science Edition

Super Learner

Mark J. van der Laan / Eric C Polley / Alan E. Hubbard

1University of California, Berkeley

1University of California, Berkeley

1University of California, Berkeley

Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 6, Issue 1, Pages –, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1309, September 2007

Publication History:
Published Online:
2007-09-16

When trying to learn a model for the prediction of an outcome given a set of covariates, a statistician has many estimation procedures in their toolbox. A few examples of these candidate learners are: least squares, least angle regression, random forests, and spline regression. Previous articles (van der Laan and Dudoit (2003); van der Laan et al. (2006); Sinisi et al. (2007)) theoretically validated the use of cross validation to select an optimal learner among many candidate learners. Motivated by this use of cross validation, we propose a new prediction method for creating a weighted combination of many candidate learners to build the super learner. This article proposes a fast algorithm for constructing a super learner in prediction which uses V-fold cross-validation to select weights to combine an initial set of candidate learners. In addition, this paper contains a practical demonstration of the adaptivity of this so called super learner to various true data generating distributions. This approach for construction of a super learner generalizes to any parameter which can be defined as a minimizer of a loss function.

Keywords: cross-validation; loss-based estimation; machine learning; prediction

Comments (0)

Please log in or register to comment.