Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido


IMPACT FACTOR 2018: 0.536
5-year IMPACT FACTOR: 0.764

CiteScore 2018: 0.49

SCImago Journal Rank (SJR) 2018: 0.316
Source Normalized Impact per Paper (SNIP) 2018: 0.342

Mathematical Citation Quotient (MCQ) 2017: 0.04

Online
ISSN
1544-6115
See all formats and pricing
More options …
Volume 9, Issue 1

Issues

Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

Sparse Partial Least Squares Classification for High Dimensional Data

Dongjun Chung / Sunduz Keles
Published Online: 2010-03-03 | DOI: https://doi.org/10.2202/1544-6115.1492

Partial least squares (PLS) is a well known dimension reduction method which has been recently adapted for high dimensional classification problems in genome biology. We develop sparse versions of the recently proposed two PLS-based classification methods using sparse partial least squares (SPLS). These sparse versions aim to achieve variable selection and dimension reduction simultaneously. We consider both binary and multicategory classification. We provide analytical and simulation-based insights about the variable selection properties of these approaches and benchmark them on well known publicly available datasets that involve tumor classification with high dimensional gene expression data. We show that incorporation of SPLS into a generalized linear model (GLM) framework provides higher sensitivity in variable selection for multicategory classification with unbalanced sample sizes between classes. As the sample size increases, the two-stage approach provides comparable sensitivity with better specificity in variable selection. In binary classification and multicategory classification with balanced sample sizes, the two-stage approach provides comparable variable selection and prediction accuracy as the GLM version and is computationally more efficient.

Keywords: partial least squares; classification; variable selection; dimension reduction; two-stage PLS; iteratively re-weighted partial least squares; gene expression

About the article

Published Online: 2010-03-03


Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 9, Issue 1, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1492.

Export Citation

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[1]
Paul Brinkman, Anke-Hilse Maitland-van der Zee, and Ariane H. Wagener
Current Opinion in Pulmonary Medicine, 2019, Volume 25, Number 1, Page 94
[2]
Fanny Orlhac, Pierre-Alexandre Mattei, Charles Bouveyron, and Nicholas Ayache
Journal of Chemometrics, 2018, Page e3097
[4]
Diletta Ami, Paolo Mereghetti, Manuela Leri, Sofia Giorgetti, Antonino Natalello, Silvia Maria Doglia, Massimo Stefani, and Monica Bucciantini
Scientific Reports, 2018, Volume 8, Number 1
[5]
Virissa Lenters, Roel Vermeulen, and Lützen Portengen
Occupational and Environmental Medicine, 2018, Volume 75, Number 7, Page 522
[6]
Yang Yang, Chao Huang, Xing Su, Jinglin Zhu, Xuetong Chen, Yingxue Fu, Zhenzhong Wang, Jun Zhou, Wei Xiao, Chunli Zheng, and Yonghua Wang
Journal of Functional Foods, 2018, Volume 47, Page 143
[7]
Jinyu Chen and Shihua Zhang
Frontiers in Genetics, 2018, Volume 9
[8]
Inge Svein Helland, Solve Saebø, Trygve Almøy, and Raju Rimal
Journal of Chemometrics, 2018, Page e3044
[9]
Marta B. Lopes, André Veríssimo, Eunice Carrasquinha, Sandra Casimiro, Niko Beerenwinkel, and Susana Vinga
BMC Bioinformatics, 2018, Volume 19, Number 1
[10]
Ran Wang, John A. Gamon, Anna K. Schweiger, Jeannine Cavender-Bares, Philip A. Townsend, Arthur I. Zygielbaum, and Shan Kothari
Remote Sensing of Environment, 2018, Volume 211, Page 218
[11]
Stéphane Mussard and Fattouma Souissi-Benrejab
Journal of Quantitative Economics, 2018
[12]
Xiaojian Yang, Fugui Yin, Yuhui Yang, Dion Lepp, Hai Yu, Zheng Ruan, Chengbo Yang, Yulong Yin, Yongqing Hou, Steve Leeson, and Joshua Gong
Scientific Reports, 2018, Volume 8, Number 1
[13]
Laura Franitza, Luca Nicolotti, Michael Granvogl, and Peter Schieberle
Journal of Agricultural and Food Chemistry, 2018
[14]
Deepshika Ramanan, Rowann Bowcutt, Soo Ching Lee, Mei San Tang, Zachary D. Kurtz, Yi Ding, Kenya Honda, William C. Gause, Martin J. Blaser, Richard A. Bonneau, Yvonne A.L. Lim, P’ng Loke, and Ken Cadwell
Science, 2016, Volume 352, Number 6285, Page 608
[15]
Wei Wei, Zequn Sun, Willian A da Silveira, Zhenning Yu, Andrew Lawson, Gary Hardiman, Linda E Kelemen, and Dongjun Chung
Statistical Methods in Medical Research, 2018, Page 096228021775298
[16]
Wei Wang, David Ackland, Jodie McClelland, Kate E. Webster, and Saman Halgamuge
IEEE Journal of Biomedical and Health Informatics, 2017, Page 1
[17]
Xiangnan Xu, Kian-Kai Cheng, Lingli Deng, and Jiyang Dong
Chemometrics and Intelligent Laboratory Systems, 2017
[18]
Helena Ursula Zacharias, Thorsten Rehberg, Sebastian Mehrl, Daniel Richtmann, Tilo Wettig, Peter J. Oefner, Rainer Spang, Wolfram Gronwald, and Michael Altenbuchinger
Journal of Proteome Research, 2017
[19]
Yanzhu Lin, Min Zhang, and Dabao Zhang
Computational Statistics & Data Analysis, 2015, Volume 88, Page 119
[20]
Caroline Truntzer, Elise Mostacci, Aline Jeannin, Jean-Michel Petit, Patrick Ducoroy, and Hervé Cardot
BMC Bioinformatics, 2014, Volume 15, Number 1
[21]
Nichola C. Garbett and Guy N. Brock
Biochimica et Biophysica Acta (BBA) - General Subjects, 2016, Volume 1860, Number 5, Page 981
[22]
Sheng Ren, Anna A. Hinzman, Emily L. Kang, Rhonda D. Szczesniak, and Long Jason Lu
Metabolomics, 2015, Volume 11, Number 6, Page 1492
[23]
Perushan Rajah, John Odindi, Elfatih Abdel-Rahman, and Onisimo Mutanga
South African Journal of Plant and Soil, 2017, Page 1
[24]
Kabir Peerbhay, Onisimo Mutanga, Romano Lottering, Victor Bangamwabo, and Riyad Ismail
ISPRS Journal of Photogrammetry and Remote Sensing, 2016, Volume 121, Page 167
[25]
Ronny Richter, Björn Reu, Christian Wirth, Daniel Doktor, and Michael Vohland
International Journal of Applied Earth Observation and Geoinformation, 2016, Volume 52, Page 464
[26]
Douglas Mahana, Chad M. Trent, Zachary D. Kurtz, Nicholas A. Bokulich, Thomas Battaglia, Jennifer Chung, Christian L. Müller, Huilin Li, Richard A. Bonneau, and Martin J. Blaser
Genome Medicine, 2016, Volume 8, Number 1
[27]
Jianling Liu, Tianli Pei, Jiexin Mu, Chunli Zheng, Xuetong Chen, Chao Huang, Yingxue Fu, Zongsuo Liang, and Yonghua Wang
Evidence-Based Complementary and Alternative Medicine, 2016, Volume 2016, Page 1
[28]
André Große-Stoltenberg, Christine Hellmann, Christiane Werner, Jens Oldeland, and Jan Thiele
Remote Sensing, 2016, Volume 8, Number 4, Page 334
[29]
Qiming Li, Yan Yan, and Hanzi Wang
IEEE Transactions on Intelligent Transportation Systems, 2016, Volume 17, Number 4, Page 1062
[30]
Seung-Hwan Bae and Kuk-Jin Yoon
IEEE Transactions on Medical Imaging, 2015, Volume 34, Number 11, Page 2379
[31]
Irene Hoffmann, Peter Filzmoser, Sven Serneels, and Kurt Varmuza
Journal of Chemometrics, 2016, Volume 30, Number 4, Page 153
[32]
Yingxiong Fu, Jiangtao Peng, and Xuemei Dong
Journal of Mathematical Chemistry, 2016, Volume 54, Number 2, Page 403
[33]
Yifeng Li and Alioune Ngom
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2013, Volume 10, Number 2, Page 447
[34]
Fabian E. Fassnacht, Carsten Neumann, Michael Forster, Henning Buddenbaum, Aniruddha Ghosh, Anne Clasen, Pawan Kumar Joshi, and Barbara Koch
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2014, Volume 7, Number 6, Page 2547
[35]
Ruixin Guo, Mihye Ahn, and Hongtu Zhu Hongtu Zhu
Journal of Computational and Graphical Statistics, 2015, Volume 24, Number 1, Page 274
[36]
Ching Siang Tan, Wai Soon Ting, Mohd Saberi Mohamad, Weng Howe Chan, Safaai Deris, and Zuraini Ali Shah
BioMed Research International, 2014, Volume 2014, Page 1
[37]
İbrahim Karaman, Natalja P. Nørskov, Christian Clement Yde, Mette Skou Hedemann, Knud Erik Bach Knudsen, and Achim Kohler
Metabolomics, 2015, Volume 11, Number 2, Page 367
[38]
C. Corbane, S. Alleaume, and M. Deshayes
International Journal of Remote Sensing, 2013, Volume 34, Number 21, Page 7625
[39]
İbrahim Karaman, El Mostafa Qannari, Harald Martens, Mette Skou Hedemann, Knud Erik Bach Knudsen, and Achim Kohler
Chemometrics and Intelligent Laboratory Systems, 2013, Volume 122, Page 65

Comments (0)

Please log in or register to comment.
Log in