Jump to ContentJump to Main Navigation
Show Summary Details

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Stumpf, Michael P.H.


IMPACT FACTOR increased in 2015: 1.265
5-year IMPACT FACTOR: 1.423
Rank 42 out of 123 in category Statistics & Probability in the 2015 Thomson Reuters Journal Citation Report/Science Edition

SCImago Journal Rank (SJR) 2015: 0.954
Source Normalized Impact per Paper (SNIP) 2015: 0.554
Impact per Publication (IPP) 2015: 1.061

Mathematical Citation Quotient (MCQ) 2015: 0.06

99,00 € / $149.00 / £75.00*

Online
ISSN
1544-6115
See all formats and pricing

 


Select Volume and Issue
Loading journal volume and issue information...

Informative or Noninformative Calls for Gene Expression: A Latent Variable Approach

Adetayo Kasim1 / Dan Lin2 / Suzy Van Sanden3 / Djork-Arné Clevert4 / Luc Bijnens5 / Hinrich Göhlmann6 / Dhammika Amaratunga7 / Sepp Hochreiter8 / Ziv Shkedy9 / Willem Talloen10

1Universiteit Hasselt & Katholieke Universiteit Leuven

2Universiteit Hasselt & Katholieke Universiteit Leuven

3Universiteit Hasselt & Katholieke Universiteit Leuven

4Johannes Kepler University Linz & Charité - Universitätsmedizin Berlin

5Janssen Pharmaceutica N. V., Beerse

6Janssen Pharmaceutica N. V., Beerse

7Johnson & Johnson Pharmaceutical Research & Development, Raritan

8Johannes Kepler University Linz

9Universiteit Hasselt & Katholieke Universiteit Leuven

10Janssen Pharmaceutica N. V., Beerse

Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 9, Issue 1, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1460, January 2010

Publication History

Published Online:
2010-01-06

The strength and weakness of microarray technology can be attributed to the enormous amount of information it is generating. To fully enhance the benefit of microarray technology for testing differentially expressed genes and classification, there is a need to minimize the amount of irrelevant genes present in microarray data. A major interest is to use probe-level data to call genes informative or noninformative based on the trade-off between the array-to-array variability and the measurement error. Existing works in this direction include filtering likely uninformative sets of hybridization (FLUSH; Calza et al., 2007) and I/NI calls for the exclusion of noninformative genes using FARMS (I/NI calls; Talloen et al., 2007; Hochreiter et al., 2006). In this paper, we propose a linear mixed model as a more flexible method that performs equally good as I/NI calls and outperforms FLUSH. We also introduce other criteria for gene filtering, such as, R2 and intra-cluster correlation. Additionally, we include some objective criteria based on likelihood ratio testing, the Akaike information criteria (AIC; Akaike, 1973) and the Bayesian information criterion (BIC; Schwarz, 1978 ).Based on the HGU-133A Spiked-in data set, it is shown that the linear mixed model approach outperforms FLUSH, a method that filters genes based on a quantile regression. The linear model is equivalent to a factor analysis model when either the factor loadings are set to a constant with the variance of the latent factor equal to one, or if the factor loadings are set to one together with unconstrained variance of the latent factor. Filtering based on conditional variance calls a probe set informative when the intensity of one or more probes is consistent across the arrays, while filtering using R2 or intra-cluster correlation calls a probe set informative only when average intensity of a probe set is consistent across the arrays. Filtering based on likelihood ratio test AIC and BIC are less stringent compared to the other criteria.

Keywords: gene filtering; factor analysis; linear mixed model

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[1]
Raffaele Fronza, Michele Tramonti, William R Atchley, and Christine Nardini
BMC Bioinformatics, 2011, Volume 12, Number 1, Page 86

Comments (0)

Please log in or register to comment.