Jump to ContentJump to Main Navigation
Show Summary Details
In This Section

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Stumpf, Michael P.H.

6 Issues per year


IMPACT FACTOR 2016: 0.646
5-year IMPACT FACTOR: 1.191

CiteScore 2016: 0.94

SCImago Journal Rank (SJR) 2015: 0.954
Source Normalized Impact per Paper (SNIP) 2015: 0.554

Mathematical Citation Quotient (MCQ) 2015: 0.06

Online
ISSN
1544-6115
See all formats and pricing
In This Section
Volume 7, Issue 1 (Nov 2008)

Issues

A Sparse PLS for Variable Selection when Integrating Omics Data

Kim-Anh Lê Cao
  • INRA UR 631 and Université de Toulouse
/ Debra Rossouw
  • University of Stellenbosch
/ Christèle Robert-Granié
  • INRA UR 631
/ Philippe Besse
  • Université de Toulouse
Published Online: 2008-11-18 | DOI: https://doi.org/10.2202/1544-6115.1390

Recent biotechnology advances allow for multiple types of omics data, such as transcriptomic, proteomic or metabolomic data sets to be integrated. The problem of feature selection has been addressed several times in the context of classification, but needs to be handled in a specific manner when integrating data. In this study, we focus on the integration of two-block data that are measured on the same samples. Our goal is to combine integration and simultaneous variable selection of the two data sets in a one-step procedure using a Partial Least Squares regression (PLS) variant to facilitate the biologists' interpretation. A novel computational methodology called ``sparse PLS" is introduced for a predictive analysis to deal with these newly arisen problems. The sparsity of our approach is achieved with a Lasso penalization of the PLS loading vectors when computing the Singular Value Decomposition.Sparse PLS is shown to be effective and biologically meaningful. Comparisons with classical PLS are performed on a simulated data set and on real data sets. On one data set, a thorough biological interpretation of the obtained results is provided. We show that sparse PLS provides a valuable variable selection tool for highly dimensional data sets.

Keywords: joint analysis; two-block data set; multivariate regression; dimension reduction

About the article

Published Online: 2008-11-18



Citation Information: Statistical Applications in Genetics and Molecular Biology, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1390. Export Citation

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[1]
Tahir Mehmood and Zahid Rasheed
Communications for Statistical Applications and Methods, 2015, Volume 22, Number 6, Page 575
[2]
B. Zimmerman, V. Tafintseva, M. Bağcıoğlu, M. Høegh Berdahl, and A. Kohler
Analytical Chemistry, 2015
[3]
Rosalba Calvini, Alessandro Ulrici, and Jose Manuel Amigo
Chemometrics and Intelligent Laboratory Systems, 2015
[4]
Tomasz Waller, Tomasz Gubała, Krzysztof Sarapata, Monika Piwowar, and Wiktor Jurkowski
BioData Mining, 2015, Volume 8, Number 1
[5]
Mingon Kang, Dong-Chul Kim, Chunyu Liu, and Jean Gao
BioMed Research International, 2015, Volume 2015, Page 1
[6]
Ewa Szymańska, Emma Brodrick, Mark Williams, Antony N. Davies, Henk-Jan van Manen, and Lutgarde M. C. Buydens
Analytical Chemistry, 2015, Volume 87, Number 2, Page 869
[7]
Mingkun Chen and Evelyne Vigneau
Advances in Data Analysis and Classification, 2014
[8]
İbrahim Karaman, Natalja P. Nørskov, Christian Clement Yde, Mette Skou Hedemann, Knud Erik Bach Knudsen, and Achim Kohler
Metabolomics, 2015, Volume 11, Number 2, Page 367
[9]
Julieta Fuentes, Pilar Poncela, and Julio Rodríguez
Journal of Applied Econometrics, 2015, Volume 30, Number 4, Page 576
[10]
C. Corbane, S. Alleaume, and M. Deshayes
International Journal of Remote Sensing, 2013, Volume 34, Number 21, Page 7625
[11]
Kristian Hovde Liland, Martin Høy, Harald Martens, and Solve Sæbø
Chemometrics and Intelligent Laboratory Systems, 2013, Volume 122, Page 103
[12]
İbrahim Karaman, El Mostafa Qannari, Harald Martens, Mette Skou Hedemann, Knud Erik Bach Knudsen, and Achim Kohler
Chemometrics and Intelligent Laboratory Systems, 2013, Volume 122, Page 65
[13]
Tahir Mehmood, Kristian Hovde Liland, Lars Snipen, and Solve Sæbø
Chemometrics and Intelligent Laboratory Systems, 2012, Volume 118, Page 62
[14]
Athanassios Kondylis and Joe Whittaker
Chemometrics and Intelligent Laboratory Systems, 2010, Volume 104, Number 2, Page 205
[15]
Cinnamon S. Bloss, Kelly M. Schiabor, and Nicholas J. Schork
Brain Research Bulletin, 2010, Volume 83, Number 3-4, Page 177
[16]
Maria Vounou, Thomas E. Nichols, and Giovanni Montana
NeuroImage, 2010, Volume 53, Number 3, Page 1147
[17]
P. Roy, C. Truntzer, D. Maucort-Boulch, T. Jouve, and N. Molinari
Briefings in Bioinformatics, 2011, Volume 12, Number 2, Page 176

Comments (0)

Please log in or register to comment.
Log in