Jump to ContentJump to Main Navigation

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Stumpf, Michael P.H.

6 Issues per year

IMPACT FACTOR 2013: 1.055
Rank 48 out of 119 in category Statistics & Probability in the 2013 Thomson Reuters Journal Citation Report/Science Edition

SCImago Journal Rank (SJR): 0.875
Source Normalized Impact per Paper (SNIP): 0.540


Multiple Imputation of Missing Phenotype Data for QTL Mapping

Jennifer F Bobb1 / Daniel O. Scharfstein2 / Michael J. Daniels3 / Francis S Collins4 / Samir Kelada5

1Johns Hopkins Bloomberg School of Public Health

2Johns Hopkins Bloomberg School of Public Health

3University of Florida

4National Human Genome Research Institute, National Institutes of Health

5National Human Genome Research Institute, National Institutes of Health

Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 10, Issue 1, Pages 1–27, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1676, June 2011

Publication History

Published Online:

Missing phenotype data can be a major hurdle to mapping quantitative trait loci (QTL). Though in many cases experiments may be designed to minimize the occurrence of missing data, it is often unavoidable in practice; thus, statistical methods to account for missing data are needed. In this paper we describe an approach for conjoining multiple imputation and QTL mapping. Methods are applied to map genes associated with increased breathing effort in mice after lung inflammation due to allergen challenge in developing lines of the Collaborative Cross, a new mouse genetics resource. Missing data poses a particular challenge in this study because the desired phenotype summary to be mapped is a function of incompletely observed dose-response curves. Comparison of the multiple imputation approach to two naive approaches for handling missing data suggest that these simpler methods may yield poor results: ignoring missing data through a complete case analysis may lead to incorrect conclusions, while using a last observation carried forward procedure, which does not account for uncertainty in the imputed values, may lead to anti-conservative inference. The proposed approach is widely applicable to other studies with missing phenotype data.

Keywords: multiple imputation; missing data; quantitative trait loci

Comments (0)

Please log in or register to comment.