A method for clustering incomplete longitudinal data, and gene expression time course data in particular, is presented. Specifically, an existing method that utilizes mixtures of multivariate Gaussian distributions with modified Cholesky-decomposed covariance structure is extended to accommodate incomplete data. Parameter estimation is carried out in a fashion that is similar to an expectation-maximization algorithm. We focus on the particular application of clustering incomplete gene expression time course data. In this application, our approach gives good clustering performance when compared to the results when there is no missing data. Possible extensions of this work are also suggested.

Ed. by Hubbard, Alan E. / van der Laan, Mark J.
1 Issue per year
IMPACT FACTOR 2011: 1.284
Issues
Volume 7 (2011)
Volume 6 (2010)
Volume 5 (2009)
Volume 4 (2008)
Volume 3 (2007)
Volume 2 (2006)
Volume 1 (2005)
Most Downloaded Articles
- An Introduction to Causal Inference by Pearl, Judea
- Meta-Analysis of Observational Studies with Unmeasured Confounders by McCandless, Lawrence C.
- Accuracy of Conventional and Marginal Structural Cox Model Estimators: A Simulation Study by Xiao, Yongling/ Abrahamowicz, Michal and Moodie, Erica E. M.
- Evaluating treatment effectiveness in patient subgroups: a comparison of propensity score methods with an automated matching approach by Radice, Rosalba/ Ramsahai, Roland/ Grieve, Richard/ Kreif, Noemi/ Sadique, Zia and Sekhon, Jasjeet S.
- Special Issue on Causal Inference in Health Research by Moodie, Erica E. M./ Kaufman, Jay S. and Platt, Robert W.
A Pseudo-EM Algorithm for Clustering Incomplete Longitudinal Data
Mateen Shaikh / Paul D McNicholas / Anthony F Desmond
1University of Guelph
1University of Guelph
1University of Guelph
Citation Information: The International Journal of Biostatistics. Volume 6, Issue 1, Pages –, ISSN (Online) 1557-4679, DOI: 10.2202/1557-4679.1223, March 2010
Publication History:
- Published Online:
- 2010-03-15
Keywords: clustering; gene expression time course data; longitudinal data; missing data; mixture models; pseudo-EM


















Comments (0)