Jump to ContentJump to Main Navigation

Online

99,00 € / $149.00*

* Prices subject to change. Shipping costs will be added if applicable.
Publication Date:
April 2012
ISSN:
1544-6115
DOI:
10.1515/1544-6115.1766

See all formats and pricing

Online
Individual Subscription Online only
Euro [D] 99.00
RRP for USA, Canada, Mexico
US$ 149.00 *
Print
Individual Subscription Online only
Euro [D] 285.00
RRP for USA, Canada, Mexico
US$ 384.00 *
Print + Online
Individual Subscription Online only
Euro [D] 342.00
RRP for USA, Canada, Mexico
US$ 461.00 *
*Prices subject to change. Shipping costs will be added if applicable.

Editor-in-Chief: Stumpf, Michael P.H.

Editorial Board Member: Beaumont, Mark / Binder, Harald / Gupta, Mayetri / Hubbard, Alan E. / Husmeier, Dirk / Ji, Hongkai / Keles, Sunduz / Kerr, Kathleen / Lazzeroni, Laura / Lin, Shili / Ma, Ping / Marjoram, Paul / Mertens, Bart / Nerman, Olle / G. Petretto, Enrico / Plagnol, Vincent / Purdom, Elizabeth / Robin, Stéphane / Rzhetsky, Andrey / Sanguinetti, Guido / van der Laan, Mark J. / von Haeseler, Arndt / Weeks, Daniel E. / Wiuf, Carsten / Zhao, Hongyu

6 Issues per year

IMPACT FACTOR 2011: 1.517
5-year IMPACT FACTOR: 1.704
Rank 27 out of 116 in category Statistics & Probability in the 2011 Thomson Reuters Journal Citation Report/Science Edition

The practical effect of batch on genomic prediction

Hilary S. Parker / Jeffrey T. Leek

1Johns Hopkins Bloomberg School of Public Health

1Johns Hopkins Bloomberg School of Public Health

Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 11, Issue 3, Pages –, ISSN (Online) 1544-6115, DOI: 10.1515/1544-6115.1766, April 2012

Publication History:
Published Online:
2012-04-16

Measurements from microarrays and other high-throughput technologies are susceptible to non-biological artifacts like batch effects. It is known that batch effects can alter or obscure the set of significant results and biological conclusions in high-throughput studies. Here we examine the impact of batch effects on predictors built from genomic technologies. To investigate batch effects, we collected publicly available gene expression measurements with known outcomes, and estimated batches using date. Using these data we show (1) the impact of batch effects on prediction depends on the correlation between outcome and batch in the training data, and (2) removing expression measurements most affected by batch before building predictors may improve the accuracy of those predictors. These results suggest that (1) training sets should be designed to minimize correlation between batches and outcome, and (2) methods for identifying batch-affected probes should be developed to improve prediction results for studies with high correlation between batches and outcome.

Keywords: batch effects; prediction; microarrays; reproducibility; research design

Comments (0)

Please log in or register to comment.