Jump to ContentJump to Main Navigation

Online

* Prices subject to change. Shipping costs will be added.
Publication Date:
8 11 2011
ISSN:
1544-6115
DOI:
10.2202/1544-6115.1732

Editor-in-Chief: Jewell, Nicholas P. / Churchill, Gary A. / Thompson, Elizabeth A.

IMPACT FACTOR 2010: 1.842
5-year IMPACT FACTOR: 2.182
Rank 14 out of 110 in category Statistics & Probability in the 2010 Thomson Reuters Journal Citation Report/Science Edition

Modeling Read Counts for CNV Detection in Exome Sequencing Data

Love, Michael I. / Myšičková, Alena / Sun, Ruping / Kalscheuer, Vera / Vingron, Martin / Haas, Stefan A.

Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 10, Issue 1, Pages –, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1732, November 2011

Publication History: Published Online: 10/04/2012

Varying depth of high-throughput sequencing reads along a chromosome makes it possible to observe copy number variants (CNVs) in a sample relative to a reference. In exome and other targeted sequencing projects, technical factors increase variation in read depth while reducing the number of observed locations, adding difficulty to the problem of identifying CNVs. We present a hidden Markov model for detecting CNVs from raw read count data, using background read depth from a control set as well as other positional covariates such as GC-content. The model, exomeCopy, is applied to a large chromosome X exome sequencing project identifying a list of large unique CNVs. CNVs predicted by the model and experimentally validated are then recovered using a cross-platform control set from publicly available exome sequencing data. Simulations show high sensitivity for detecting heterozygous and homozygous CNVs, outperforming normalization and state-of-the-art segmentation methods.

Keywords: exorne sequencing; targeted sequencing; CNV; copy number variant; HMM; hidden Markov model

Comments (0)

Please log in or register to comment.