Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido


IMPACT FACTOR 2018: 0.536
5-year IMPACT FACTOR: 0.764

CiteScore 2018: 0.49

SCImago Journal Rank (SJR) 2018: 0.316
Source Normalized Impact per Paper (SNIP) 2018: 0.342

Mathematical Citation Quotient (MCQ) 2017: 0.04

Online
ISSN
1544-6115
See all formats and pricing
More options …
Volume 3, Issue 1

Issues

Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

Statistical Significance Threshold Criteria For Analysis of Microarray Gene Expression Data

Cheng Cheng / Stanley B. Pounds / James M. Boyett / Deqing Pei / Mei-Ling Kuo / Martine F. Roussel
Published Online: 2004-12-19 | DOI: https://doi.org/10.2202/1544-6115.1064

The methodological advancement in microarray data analysis on the basis of false discovery rate (FDR) control, such as the q-value plots, allows the investigator to examine the FDR from several perspectives. However, when FDR control at the “customary" levels 0.01, 0.05, or 0.1 does not provide fruitful findings, there is little guidance for making the trade off between the significance threshold and the FDR level by sound statistical or biological considerations. Thus, meaningful statistical significance criteria that complement the existing FDR methods for large-scale multiple tests are desirable. Three statistical significance criteria, the profile information criterion, the total error proportion, and the guide-gene driven selection, are developed in this research. The first two are general significance threshold criteria for large-scale multiple tests; the profile information criterion is related to the recent theoretical studies of the connection between FDR control and minimax estimation, and the total error proportion is closely related to the asymptotic properties of FDR control in terms of the total error risk. The guide-gene driven selection is an approach to combining statistical significance and the existing biological knowledge of the study at hand. Error properties of these criteria are investigated theoretically and by simulation. The proposed methods are illustrated and compared using an example of genomic screening for novel Arf gene targets. Operating characteristics of q-value and the proposed significance threshold criteria are investigated and compared in a simulation study that employs a model mimicking a gene regulatory pathway. A guideline for using these criteria is provided. Splus/R code is available from the corresponding author upon request.

Keywords: multiple tests; significance threshold selection; profile information criterion; total error proportion; false discovery rate; q-value; microarray; gene expression

About the article

Published Online: 2004-12-19


Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 3, Issue 1, Pages 1–30, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1064.

Export Citation

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[1]
Kristin Shrader-Frechette
Biological Theory, 2008, Volume 3, Number 1, Page 12
[2]
Hiroto Inaba, Xueyuan Cao, Alice Q. Han, John C. Panetta, Kirsten K. Ness, Monika L. Metzger, Jeffrey E. Rubnitz, Raul C. Ribeiro, John T. Sandlund, Sima Jeha, Cheng Cheng, Ching-Hon Pui, Mary V. Relling, and Sue C. Kaste
Cancer, 2017
[3]
Jiewu Liu, Qian Huang, Jason Higdon, Wei Liu, Tao Xie, Tetsuji Yamashita, Kyeogmi Cheon, Cheng Cheng, and Jian Zuo
Human Molecular Genetics, 2005, Volume 14, Number 19, Page 2945
[4]
James J. Chen, Paula K. Robeson, and Michael J. Schell
Cancer Control, 2010, Volume 17, Number 1, Page 58
[5]
Daniel L. Hunt, Cheng Cheng, and Stanley Pounds
Computational Statistics & Data Analysis, 2009, Volume 53, Number 5, Page 1688
[6]
Kathleen F. Kerr
Bioinformatics, 2009, Volume 25, Number 16, Page 2035
[7]
Laura B. Ramsey, Stan Pounds, Cheng Cheng, Xueyuan Cao, Wenjian Yang, Colton Smith, Seth E. Karol, Chengcheng Liu, John C. Panetta, Hiroto Inaba, Jeffrey E. Rubnitz, Monika L. Metzger, Raul C. Ribeiro, John T. Sandlund, Sima Jeha, Ching-Hon Pui, William E. Evans, and Mary V. Relling
Pharmacogenetics and Genomics, 2017, Volume 27, Number 8, Page 294
[8]
S E Karol, E Larsen, C Cheng, X Cao, W Yang, L B Ramsey, C A Fernandez, J R McCorkle, S W Paugh, R J Autry, E Lopez-Lopez, B Diouf, S Jeha, C-H Pui, E A Raetz, N J Winick, W L Carroll, S P Hunger, M L Loh, M Devidas, W E Evans, J J Yang, and M V Relling
Leukemia, 2017, Volume 31, Number 6, Page 1325
[9]
Shu-Dong Zhang and Ying Xu
PLoS ONE, 2011, Volume 6, Number 4, Page e18874
[10]
J. X. Rong, Y. Qiu, M. K. Hansen, L. Zhu, V. Zhang, M. Xie, Y. Okamoto, M. D. Mattie, H. Higashiyama, S. Asano, J. C. Strum, and T. E. Ryan
Diabetes, 2007, Volume 56, Number 7, Page 1751
[11]
Cheng Cheng
Computational Statistics & Data Analysis, 2009, Volume 53, Number 3, Page 788
[12]
Xinping Cui, Haibing Zhao, and Jason Wilson
Journal of Biopharmaceutical Statistics, 2010, Volume 20, Number 2, Page 223

Comments (0)

Please log in or register to comment.
Log in