Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido

6 Issues per year


IMPACT FACTOR 2017: 0.812
5-year IMPACT FACTOR: 1.104

CiteScore 2017: 0.86

SCImago Journal Rank (SJR) 2017: 0.456
Source Normalized Impact per Paper (SNIP) 2017: 0.527

Mathematical Citation Quotient (MCQ) 2017: 0.04

Online
ISSN
1544-6115
See all formats and pricing
More options …
Volume 10, Issue 1

Issues

Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

High-Dimensional Regression and Variable Selection Using CAR Scores

Verena Zuber / Korbinian Strimmer
Published Online: 2011-07-18 | DOI: https://doi.org/10.2202/1544-6115.1730

Variable selection is a difficult problem that is particularly challenging in the analysis of high-dimensional genomic data. Here, we introduce the CAR score, a novel and highly effective criterion for variable ranking in linear regression based on Mahalanobis-decorrelation of the explanatory variables. The CAR score provides a canonical ordering that encourages grouping of correlated predictors and down-weights antagonistic variables. It decomposes the proportion of variance explained and it is an intermediate between marginal correlation and the standardized regression coefficient. As a population quantity, any preferred inference scheme can be applied for its estimation. Using simulations, we demonstrate that variable selection by CAR scores is very effective and yields prediction errors and true and false positive rates that compare favorably with modern regression techniques such as elastic net and boosting. We illustrate our approach by analyzing data concerned with diabetes progression and with the effect of aging on gene expression in the human brain. The R package “care” implementing CAR score regression is available from CRAN.

Keywords: variable importance; variable selection; decorrelation; lasso; elastic net; boosting; CAR score

About the article

Published Online: 2011-07-18


Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 10, Issue 1, ISSN (Online) 1544-6115, ISSN (Print) 2194-6302, DOI: https://doi.org/10.2202/1544-6115.1730.

Export Citation

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[2]
Saffron A. G. Willis-Owen, Anna Thompson, Paul R. Kemp, Michael I. Polkey, William O. C. M. Cookson, Miriam F. Moffatt, and Samantha A. Natanek
Scientific Reports, 2018, Volume 8, Number 1
[3]
Kirsten M de Beurs, Geoffrey M Henebry, Braden C Owsley, and Irina N Sokolik
Environmental Research Letters, 2018, Volume 13, Number 6, Page 065018
[4]
Dominik Seidel, Peter Annighöfer, Martin Ehbrecht, Christian Ammer, and Peter Schall
Ecological Indicators, 2018, Volume 93, Page 243
[5]
Chris J. Law, Colleen Young, and Rita S. Mehta
Physiological and Biochemical Zoology, 2016, Volume 89, Number 5, Page 347
[6]
Raquel Iniesta, Karen Hodgson, Daniel Stahl, Karim Malki, Wolfgang Maier, Marcella Rietschel, Ole Mors, Joanna Hauser, Neven Henigsberg, Mojca Zvezdana Dernovsek, Daniel Souery, Richard Dobson, Katherine J. Aitchison, Anne Farmer, Peter McGuffin, Cathryn M. Lewis, and Rudolf Uher
Scientific Reports, 2018, Volume 8, Number 1
[7]
S M Castro, G A Sanchez-Azofeifa, and H Sato
Environmental Research Letters, 2018, Volume 13, Number 4, Page 045001
[8]
Ulrike Grömping
Wiley Interdisciplinary Reviews: Computational Statistics, 2015, Volume 7, Number 2, Page 137
[9]
R. Kyle Bocinsky and Timothy A. Kohler
Nature Communications, 2014, Volume 5, Page 5618
[10]
Robin Andersson, Claudia Gebhard, Irene Miguel-Escalada, Ilka Hoof, Jette Bornholdt, Mette Boyd, Yun Chen, Xiaobei Zhao, Christian Schmidl, Takahiro Suzuki, Evgenia Ntini, Erik Arner, Eivind Valen, Kang Li, Lucia Schwarzfischer, Dagmar Glatz, Johanna Raithel, Berit Lilje, Nicolas Rapin, Frederik Otzen Bagger, Mette Jørgensen, Peter Refsing Andersen, Nicolas Bertin, Owen Rackham, A. Maxwell Burroughs, J. Kenneth Baillie, Yuri Ishizu, Yuri Shimizu, Erina Furuhata, Shiori Maeda, Yutaka Negishi, Christopher J. Mungall, Terrence F. Meehan, Timo Lassmann, Masayoshi Itoh, Hideya Kawaji, Naoto Kondo, Jun Kawai, Andreas Lennartsson, Carsten O. Daub, Peter Heutink, David A. Hume, Torben Heick Jensen, Harukazu Suzuki, Yoshihide Hayashizaki, Ferenc Müller, The FANTOM Consortium, Alistair R. R. Forrest, Piero Carninci, Michael Rehli, and Albin Sandelin
Nature, 2014, Volume 507, Number 7493, Page 455
[11]
Charles E. Vejnar and Evgeny M. Zdobnov
Nucleic Acids Research, 2012, Volume 40, Number 22, Page 11673
[12]
Jan Kalina
Biocybernetics and Biomedical Engineering, 2014, Volume 34, Number 1, Page 10
[13]
Agnan Kessy, Alex Lewin, and Korbinian Strimmer
The American Statistician, 2017, Page 0
[14]
Léa Maitre, Cristina M. Villanueva, Matthew R. Lewis, Jesús Ibarluzea, Loreto Santa-Marina, Martine Vrijheid, Jordi Sunyer, Muireann Coen, and Mireille B. Toledano
BMC Medicine, 2016, Volume 14, Number 1
[15]
Melanie Ganz, Douglas N. Greve, Bruce Fischl, and Ender Konukoglu
NeuroImage, 2015, Volume 122, Page 131
[16]
Pengfei Wei, Zhenzhou Lu, and Jingwen Song
Reliability Engineering & System Safety, 2015, Volume 142, Page 399
[17]
Mathias Kirchner, Martin Schönhart, and Erwin Schmid
Ecological Economics, 2016, Volume 123, Page 35
[19]
Pengfei Wei, Yanyan Wang, and Chenghu Tang
Structural and Multidisciplinary Optimization, 2017, Volume 55, Number 5, Page 1883
[20]
S. Ejaz Ahmed and Bahadır Yüzbaşı
International Journal of Management Science and Engineering Management, 2016, Volume 11, Number 2, Page 105
[21]
Holger Kirsten, Hoor Al-Hasani, Lesca Holdt, Arnd Gross, Frank Beutner, Knut Krohn, Katrin Horn, Peter Ahnert, Ralph Burkhardt, Kristin Reiche, Jörg Hackermüller, Markus Löffler, Daniel Teupser, Joachim Thiery, and Markus Scholz
Human Molecular Genetics, 2015, Volume 24, Number 16, Page 4746
[22]
M. Siwek, A. Slawinska, M. Rydzanicz, J. Wesoly, M. Fraszczak, T. Suchocki, J. Skiba, K. Skiba, and J. Szyda
Animal Genetics, 2015, Volume 46, Number 3, Page 247
[23]
Alexander Benedikt Leichtle, Uta Ceglarek, Peter Weinert, Christos T. Nakas, Jean-Marc Nuoffer, Julia Kase, Tim Conrad, Helmut Witzigmann, Joachim Thiery, and Georg Martin Fiedler
Metabolomics, 2013, Volume 9, Number 3, Page 677
[24]
Bingqing Lin and Zhen Pang
Journal of Computational and Graphical Statistics, 2014, Volume 23, Number 2, Page 478
[25]
Frank Niemeyer, Hans-Joachim Wilke, and Hendrik Schmidt
Journal of Biomechanics, 2012, Volume 45, Number 8, Page 1414
[26]
Tasadduq Imam, Kevin Tickle, Abdullahi Ahmed, and William Guo
Intelligent Systems in Accounting, Finance and Management, 2012, Volume 19, Number 1, Page 19

Comments (0)

Please log in or register to comment.
Log in