Statistical inference of graphical models has become an important tool in the reconstruction of biological networks of the type which model, for example, gene regulatory interactions. In particular, the construction of a score-based Bayesian posterior density over the space of models provides an intuitive and computationally feasible method of assessing model uncertainty and of assigning statistical confidence to structural features. One problem which frequently occurs with this approach is the tendency to overestimate the degree of model complexity. Spurious graphical features obtained in this way may affect the inference in unpredictable ways, even when using scoring techniques, such as the Bayesian Information Criterion (BIC), that are specifically designed to compensate for overfitting.In this article we propose a simple adjustment to a BIC-based scoring procedure. The method proceeds in two steps. In the first step we derive an independent estimate of the parametric complexity of the model. In the second we modify the BIC score so that the mean parametric complexity of the posterior density is equal to the estimated value. The method is applied to a set of test networks, and to a collection of genes from the yeast genome known to possess regulatory relationships. A Bayesian network model with binary responses is employed. In the examples considered, we find that the number of spurious graph edges inferred is reduced, while the effect on the identification of true edges is minimal.

Editor-in-Chief: Stumpf, Michael P.H.
Editorial Board Member: Beaumont, Mark / Binder, Harald / Gupta, Mayetri / Hubbard, Alan E. / Husmeier, Dirk / Ji, Hongkai / Keles, Sunduz / Kerr, Kathleen / Lazzeroni, Laura / Lin, Shili / Ma, Ping / Marjoram, Paul / Mertens, Bart / Nerman, Olle / G. Petretto, Enrico / Plagnol, Vincent / Purdom, Elizabeth / Robin, Stéphane / Rzhetsky, Andrey / Sanguinetti, Guido / van der Laan, Mark J. / von Haeseler, Arndt / Weeks, Daniel E. / Wiuf, Carsten / Zhao, Hongyu
6 Issues per year
IMPACT FACTOR 2011: 1.517
5-year IMPACT FACTOR: 1.704
Rank 27 out of 116 in category Statistics & Probability in the 2011 Thomson Reuters Journal Citation Report/Science Edition
Issues
Volume 12 (2013)
Volume 11 (2012)
Volume 10 (2011)
Volume 9 (2010)
Volume 8 (2009)
Volume 7 (2008)
Volume 6 (2007)
Volume 5 (2006)
Volume 4 (2005)
Volume 3 (2004)
Volume 2 (2003)
Volume 1 (2002)
Most Downloaded Articles
- A General Framework for Weighted Gene Co-Expression Network Analysis by Zhang, Bin and Horvath, Steve
- Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments by Smyth, Gordon K
- Detecting Differential Expression in RNA-sequence Data Using Quasi-likelihood with Shrunken Dispersion Estimates by Lund, Steven P./ Nettleton, Dan/ McCarthy, Davis J. and Smyth, Gordon K.
- Adjusting for Spurious Gene-by-Environment Interaction Using Case-Parent Triads by Shin, Ji-Hyung/ Infante-Rivard, Claire/ Graham, Jinko and McNeney, Brad
- A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics by Schäfer, Juliane and Strimmer, Korbinian
Using Complexity for the Estimation of Bayesian Networks
1University of Rochester
1University of Rochester
Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 5, Issue 1, Pages –, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1208, August 2006
- Published Online:
- 2006-08-31
Keywords: gene expression data; gene regulatory network; directed acyclic graph; model selection; Bayes information criteria


















Comments (0)