Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido


IMPACT FACTOR 2018: 0.536
5-year IMPACT FACTOR: 0.764

CiteScore 2018: 0.49

SCImago Journal Rank (SJR) 2018: 0.316
Source Normalized Impact per Paper (SNIP) 2018: 0.342

Mathematical Citation Quotient (MCQ) 2017: 0.04

Online
ISSN
1544-6115
See all formats and pricing
More options …
Volume 9, Issue 1

Issues

Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

On the Optimal Design of Genetic Variant Discovery Studies

Iuliana Ionita-Laza / Nan M Laird
Published Online: 2010-08-27 | DOI: https://doi.org/10.2202/1544-6115.1581

The recent emergence of massively parallel sequencing technologies has enabled an increasing number of human genome re-sequencing studies, notable among them being the 1000 Genomes Project. The main aim of these studies is to identify the yet unknown genetic variants in a genomic region, mostly low frequency variants (frequency less than 5%). We propose here a set of statistical tools that address how to optimally design such studies in order to increase the number of genetic variants we expect to discover. Within this framework, the tradeoff between lower coverage for more individuals and higher coverage for fewer individuals can be naturally solved.The methods here are also useful for estimating the number of genetic variants missed in a discovery study performed at low coverage.We show applications to simulated data based on coalescent models and to sequence data from the ENCODE project. In particular, we show the extent to which combining data from multiple populations in a discovery study may increase the number of genetic variants identified relative to studies on single populations.

Keywords: species problem; variant discovery studies; sequencing technologies

About the article

Published Online: 2010-08-27


Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 9, Issue 1, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1581.

Export Citation

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[1]
Fan Yang and Duncan C. Thomas
Human Heredity, 2011, Volume 71, Number 4, Page 209
[3]
Yun Li, Wei Chen, Eric Yi Liu, and Yi-Hui Zhou
Statistics in Biosciences, 2013, Volume 5, Number 1, Page 3
[4]
Iuliana Ionita-Laza, Michael H. Cho, and Nan M. Laird
Statistics in Biosciences, 2013, Volume 5, Number 1, Page 54
[5]
James Zou, Gregory Valiant, Paul Valiant, Konrad Karczewski, Siu On Chan, Kaitlin Samocha, Monkol Lek, Shamil Sunyaev, Mark Daly, and Daniel G. MacArthur
Nature Communications, 2016, Volume 7, Page 13293
[6]
Marco Battiston, Stefano Favaro, and Yee Whye Teh
Journal of the American Statistical Association, 2016, Page 0
[7]
Hua Zhou and Yiwen Zhang
Computational Statistics & Data Analysis, 2012, Volume 56, Number 12, Page 3909

Comments (0)

Please log in or register to comment.
Log in