Statistical Applications in Genetics and Molecular Biology
Editor-in-Chief: Stumpf, Michael P.H.
IMPACT FACTOR increased in 2015: 1.265
5-year IMPACT FACTOR: 1.423
Rank 42 out of 123 in category Statistics & Probability in the 2015 Thomson Reuters Journal Citation Report/Science Edition
SCImago Journal Rank (SJR) 2015: 0.954
Source Normalized Impact per Paper (SNIP) 2015: 0.554
Impact per Publication (IPP) 2015: 1.061
Mathematical Citation Quotient (MCQ) 2015: 0.06
Gene Filtering in the Analysis of Illumina Microarray Experiments
1Katholieke Universiteit Leuven and Universiteit Hasselt
2Katholieke Universiteit Leuven and Universiteit Hasselt
4Katholieke Universiteit Leuven and Universiteit Hasselt
5Katholieke Universiteit Leuven and Universiteit Hasselt
6Janssen Pharmaceutica N. V.
7Johnson & Johnson Pharmaceutical Research & Development
8Katholieke Universiteit Leuven and Universiteit Hasselt
Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 11, Issue 2, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1710, January 2012
- Published Online:
Illumina bead arrays are microarrays that contain a random number of technical replicates (beads) for every probe (bead type) within the same array. Typically around 30 beads are placed at random positions on the array surface, which opens unique opportunities for quality control. Most preprocessing methods for Illumina bead arrays are ported from the Affymetrix microarray platform and ignore the availability of the technical replicates. The large number of beads for a particular bead type on the same array, however, should be highly correlated, otherwise they just measure noise and can be removed from the downstream analysis. Hence, filtering bead types can be considered as an important step of the preprocessing procedure for Illumina platform. This paper proposes a filtering method for Illumina bead arrays, which builds upon the mixed model framework. Bead types are called informative/non-informative (I/NI) based on a trade-off between within and between array variabilities. The method is illustrated on a publicly available Illumina Spike-in data set (Dunning et al., 2008) and we also show that filtering results in a more powerful analysis of differentially expressed genes.