Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido


IMPACT FACTOR 2017: 0.812
5-year IMPACT FACTOR: 1.104

CiteScore 2017: 0.86

SCImago Journal Rank (SJR) 2017: 0.456
Source Normalized Impact per Paper (SNIP) 2017: 0.527

Mathematical Citation Quotient (MCQ) 2017: 0.04

Online
ISSN
1544-6115
See all formats and pricing
More options …
Volume 3, Issue 1

Issues

Volume 18 (2019)

Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

Classifying Gene Expression Profiles from Pairwise mRNA Comparisons

Donald Geman / Christian d'Avignon / Daniel Q. Naiman / Raimond L. Winslow
Published Online: 2004-08-30 | DOI: https://doi.org/10.2202/1544-6115.1071

We present a new approach to molecular classification based on mRNA comparisons. Our method, referred to as the top-scoring pair(s) (TSP) classifier, is motivated by current technical and practical limitations in using gene expression microarray data for class prediction, for example to detect disease, identify tumors or predict treatment response. Accurate statistical inference from such data is difficult due to the small number of observations, typically tens, relative to the large number of genes, typically thousands. Moreover, conventional methods from machine learning lead to decisions which are usually very difficult to interpret in simple or biologically meaningful terms. In contrast, the TSP classifier provides decision rules which i) involve very few genes and only relative expression values (e.g., comparing the mRNA counts within a single pair of genes); ii) are both accurate and transparent; and iii) provide specific hypotheses for follow-up studies. In particular, the TSP classifier achieves prediction rates with standard cancer data that are as high as those of previous studies which use considerably more genes and complex procedures. Finally, the TSP classifier is parameter-free, thus avoiding the type of over-fitting and inflated estimates of performance that result when all aspects of learning a predictor are not properly cross-validated.

Keywords: microarray data; class prediction; mRNA comparisons

About the article

Published Online: 2004-08-30


Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 3, Issue 1, Pages 1–19, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1071.

Export Citation

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

[1]
Sameer Salunkhe, Naren Chandran, Pratik Chandrani, Amit Dutt, and Shilpee Dutt
Briefings in Bioinformatics, 2018
[2]
Patrick Slama, Michael R. Hoopmann, Robert L. Moritz, and Donald Geman
Molecular Omics, 2018
[3]
Dominik Langgartner, Andrea M. Füchsl, Lisa M. Kaiser, Tatjana Meier, Sandra Foertsch, Christian Buske, Stefan O. Reber, Medhanie A. Mulaw, and Yvette Tache
PLOS ONE, 2018, Volume 13, Number 9, Page e0202471
[4]
Te-Yao Hsu, Jyun-Mu Lin, Mai-Huong T. Nguyen, Feng-Hsiang Chung, Ching-Chang Tsai, Hsin-Hsin Cheng, Yun-Ju Lai, Hsuan-Ning Hung, and Chien-Sheng Chen
Molecular & Cellular Proteomics, 2018, Volume 17, Number 8, Page 1457
[5]
Guini Hong, Hongdong Li, Mengyao Li, Weicheng Zheng, Jing Li, Meirong Chi, Jun Cheng, and Zheng Guo
Briefings in Bioinformatics, 2018, Volume 19, Number 4, Page 613
[7]
Dimitri Kagaris, Alireza Khamesipour, and Constantin T. Yiannoutsos
BMC Bioinformatics, 2018, Volume 19, Number 1
[8]
Yunlong Jiao and Jean-Philippe Vert
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, Volume 40, Number 7, Page 1755
[9]
Xin Huang, Xiaohui Lin, Lina Zhou, and Benzhe Su
Journal of Pharmaceutical and Biomedical Analysis, 2018
[10]
Wikum Dinalankara, Qian Ke, Yiran Xu, Lanlan Ji, Nicole Pagane, Anching Lien, Tejasvi Matam, Elana J. Fertig, Nathan D. Price, Laurent Younes, Luigi Marchionni, and Donald Geman
Proceedings of the National Academy of Sciences, 2018, Page 201721628
[11]
Sarah E. Dickinson, Brock A. Griffin, Michelle F. Elmore, Lisa Kriese-Anderson, Joshua B. Elmore, Paul W. Dyce, Soren P. Rodning, and Fernando H. Biase
BMC Genomics, 2018, Volume 19, Number 1
[12]
Jochen Hochrein, Matthias S. Klein, Helena U. Zacharias, Juan Li, Gene Wijffels, Horst Joachim Schirra, Rainer Spang, Peter J. Oefner, and Wolfram Gronwald
Journal of Proteome Research, 2012, Volume 11, Number 12, Page 6242
[13]
Huaping Liu, Yawei Li, Jun He, Qingzhou Guan, Rou Chen, Haidan Yan, Weicheng Zheng, Kai Song, Hao Cai, You Guo, Xianlong Wang, and Zheng Guo
BMC Genomics, 2017, Volume 18, Number 1
[14]
Pengwei Xing, Yuan Chen, Jun Gao, Lianyang Bai, and Zheming Yuan
Scientific Reports, 2017, Volume 7, Number 1
[15]
Rex Shen, Lan Luo, and Hui Jiang
BMC Bioinformatics, 2017, Volume 18, Number 1
[16]
Xin Huang, Xiaohui Lin, Jun Zeng, Lichao Wang, Peiyuan Yin, Lina Zhou, Chunxiu Hu, and Weihong Yao
Scientific Reports, 2017, Volume 7, Number 1
[17]
Sunanda Das and Asit Kumar Das
International Journal of Rough Sets and Data Analysis, 2018, Volume 5, Number 1, Page 1
[18]
Benjamin Ulfenborg, Karin Klinga-Levan, and Björn Olsson
Cancer Informatics, 2013, Volume 12, Page CIN.S10356
[19]
Endre Sebestyén, Michał Zawisza, and Eduardo Eyras
Nucleic Acids Research, 2015, Volume 43, Number 3, Page 1345
[20]
Xiaosheng Wang and Osamu Gotoh
Cancer Informatics, 2009, Volume 7, Page CIN.S2655
[21]
Xiaosheng Wang and Osamu Gotoh
Cancer Informatics, 2010, Volume 9, Page CIN.S3794
[22]
Richard Simon, Amy Lam, Ming-Chung Li, Michael Ngan, Supriya Menenzes, and Yingdong Zhao
Cancer Informatics, 2007, Volume 3, Page 117693510700300
[23]
Tianzhou Ma, Chi Song, and George C. Tseng
Statistical Modelling: An International Journal, 2017, Volume 17, Number 4-5, Page 305
[24]
Thomas J. Fuchs and Joachim M. Buhmann
Computerized Medical Imaging and Graphics, 2011, Volume 35, Number 7-8, Page 515
[25]
Hsi-Che Liu, Chien-Yu Chen, Yu-Ting Liu, Cheng-Bang Chu, Der-Cherng Liang, Lee-Yung Shih, and Chih-Jen Lin
Journal of Biomedical Informatics, 2008, Volume 41, Number 4, Page 570
[26]
Xiaosheng Wang
Genomics, 2012, Volume 99, Number 2, Page 90
[27]
David Weisman, Hong Liu, Jessica Redfern, Liya Zhu, and Adán Colón-Carmona
Environmental Science & Technology, 2011, Volume 45, Number 12, Page 5132
[28]
Nathan D. Price, Greg Foltz, Anup Madan, Leroy Hood, and Qiang Tian
Journal of Cellular and Molecular Medicine, 2007, Volume 12, Number 1, Page 97
[30]
Mario R. Guarracino, Altannar Chinchuluun, and Panos M. Pardalos
Optimization Letters, 2009, Volume 3, Number 3, Page 357
[31]
Hongdong Li, Guini Hong, Hui Xu, and Zheng Guo
Gene, 2015, Volume 555, Number 2, Page 203
[32]
Jeffrey L. Ebersole, Dolph Dawson, Pinar Emecen-Huja, Radhakrishnan Nagarajan, Katherine Howard, Martha E. Grady, Katherine Thompson, Rebecca Peyyala, Ahmad Al-Attar, Kathryn Lethbridge, Sreenatha Kirakodu, and Octavio A. Gonzalez
Periodontology 2000, 2017, Volume 75, Number 1, Page 52
[33]
Komal S. Rathi, Daria A. Gaykalova, Patrick Hennessey, Joseph A. Califano, and Michael F. Ochs
Drug Development Research, 2014, Volume 75, Number 6, Page 343
[34]
Troy J. Anderson, Irina Tchernyshyov, Roberto Diez, Robert N. Cole, Donald Geman, Chi V. Dang, and Raimond L. Winslow
PROTEOMICS, 2007, Volume 7, Number 8, Page 1197
[35]
Wanwei Zhang, Tao Zeng, and Luonan Chen
Journal of Theoretical Biology, 2014, Volume 362, Page 35
[36]
Claudio Isella, Francesco Brundu, Sara E. Bellomo, Francesco Galimi, Eugenia Zanella, Roberta Porporato, Consalvo Petti, Alessandro Fiori, Francesca Orzan, Rebecca Senetta, Carla Boccaccio, Elisa Ficarra, Luigi Marchionni, Livio Trusolino, Enzo Medico, and Andrea Bertotti
Nature Communications, 2017, Volume 8, Page 15107
[37]
Margaret Calciano, Jean Christophe Lemarié, Elodie Blondiaux, Richard Einstein, and Pascale Fehlbaum-Beurdeley
Biomarkers, 2013, Volume 18, Number 3, Page 264
[38]
Blaise Hanczar, Jean-Daniel Zucker, Corneliu Henegar, and Lorenza Saitta
Bioinformatics, 2007, Volume 23, Number 21, Page 2866
[39]
Bahman Afsari, Elana J. Fertig, Donald Geman, and Luigi Marchionni
Bioinformatics, 2015, Volume 31, Number 2, Page 273
[40]
Hongwei Wang, Qiang Sun, Wenyuan Zhao, Lishuang Qi, Yunyan Gu, Pengfei Li, Mengmeng Zhang, Yang Li, Shu-Lin Liu, and Zheng Guo
Bioinformatics, 2015, Volume 31, Number 1, Page 62
[41]
Andrew T. Magis, John C. Earls, Youn-Hee Ko, James A. Eddy, and Nathan D. Price
Bioinformatics, 2011, Volume 27, Number 6, Page 872
[42]
C. Nichita, L. Ciarloni, S. Monnier-Benoit, S. Hosseinian, G. Dorta, and C. Rüegg
Alimentary Pharmacology & Therapeutics, 2014, Volume 39, Number 5, Page 507
[43]
Armando Fernandes, Susana Vinga, and Collin M. Stultz
PLOS ONE, 2016, Volume 11, Number 3, Page e0150369
[44]
J. T. Poirier, Irina Dobromilskaya, Whei F. Moriarty, Craig D. Peacock, Christine L. Hann, and Charles M. Rudin
JNCI: Journal of the National Cancer Institute, 2013, Volume 105, Number 14, Page 1059
[45]
Fabien Crauste, Julien Mafille, Lilia Boucinha, Sophia Djebali, Olivier Gandrillon, Jacqueline Marvel, and Christophe Arpin
Cell Systems, 2017, Volume 4, Number 3, Page 306
[46]
Tessa E. Pronk, Eugene P. van Someren, Rob H. Stierum, Janine Ezendam, and Jeroen L.A. Pennings
Journal of Applied Toxicology, 2013, Volume 33, Number 12, Page 1407
[47]
George R. Saade, Kim A. Boggess, Scott A. Sullivan, Glenn R. Markenson, Jay D. Iams, Dean V. Coonrod, Leonardo M. Pereira, M. Sean Esplin, Larry M. Cousins, Garrett K. Lam, Matthew K. Hoffman, Robert D. Severinsen, Trina Pugmire, Jeff S. Flick, Angela C. Fox, Amir J. Lueth, Sharon R. Rust, Emanuele Mazzola, ChienTing Hsu, Max T. Dufford, Chad L. Bradford, Ilia E. Ichetovkin, Tracey C. Fleischer, Ashoka D. Polpitiya, Gregory C. Critchfield, Paul E. Kearney, J. Jay Boniface, and Durlin E. Hickok
American Journal of Obstetrics and Gynecology, 2016, Volume 214, Number 5, Page 633.e1
[48]
Cynthia J. Carlyn, Nancy J. Andersen, Aldona L. Baltch, Raymond Smith, Andrew A. Reilly, and David A. Lawrence
Diagnostic Microbiology and Infectious Disease, 2015, Volume 83, Number 3, Page 312
[49]
Hyunjin Kim, Jaegyoon Ahn, Chihyun Park, Youngmi Yoon, and Sanghyun Park
Computers in Biology and Medicine, 2013, Volume 43, Number 10, Page 1363
[50]
James A. Eddy, Jaeyun Sung, Donald Geman, and Nathan D. Price
Technology in Cancer Research & Treatment, 2010, Volume 9, Number 2, Page 149
[51]
Shuyi Ma, Jaeyun Sung, Andrew T. Magis, Yuliang Wang, Donald Geman, Nathan D. Price, and Chuhsing Kate Hsiao
PLoS ONE, 2014, Volume 9, Number 10, Page e110840
[52]
James A. Eddy, Leroy Hood, Nathan D. Price, Donald Geman, and Doheon Lee
PLoS Computational Biology, 2010, Volume 6, Number 5, Page e1000792
[53]
Tyler A. Herek, Timothy D. Shew, Heather N. Spurgin, Christine E. Cutucache, and Ivan R Nabi
PLOS ONE, 2015, Volume 10, Number 11, Page e0142682
[54]
Mehrab Ghanat Bari, Sirajul Salekin, and Jianqiu Michelle Zhang
Molecular Informatics, 2017, Volume 36, Number 4, Page 1600099
[55]
Nicolas Ugolin, Catherine Ory, Emilie Lefevre, Nora Benhabiles, Paul Hofman, Martin Schlumberger, Sylvie Chevillard, and Ying Xu
PLoS ONE, 2011, Volume 6, Number 8, Page e23581
[56]
Youssef M. Youssef, Nicole M.A. White, Jörg Grigull, Adriana Krizova, Christina Samy, Salvador Mejia-Guerrero, Andrew Evans, and George M. Yousef
European Urology, 2011, Volume 59, Number 5, Page 721
[57]
Lishuang Qi, Yang Li, Yuan Qin, Gengen Shi, Tianhao Li, Jiasheng Wang, Libin Chen, Yunyan Gu, Wenyuan Zhao, and Zheng Guo
British Journal of Cancer, 2016, Volume 115, Number 12, Page 1513
[58]
Merja Heinäniemi, Matti Nykter, Roger Kramer, Anke Wienecke-Baldacchino, Lasse Sinkkonen, Joseph Xu Zhou, Richard Kreisberg, Stuart A Kauffman, Sui Huang, and Ilya Shmulevich
Nature Methods, 2013, Volume 10, Number 6, Page 577
[59]
Li Wang, Yixuan Gong, Uma Chippada-Venkata, Matthias Michael Heck, Margitta Retz, Roman Nawroth, Matthew Galsky, Che-Kai Tsao, Eric Schadt, Johann de Bono, David Olmos, Jun Zhu, and William K. Oh
BMC Medicine, 2015, Volume 13, Number 1
[60]
Antonio Scialdone, Kedar N. Natarajan, Luis R. Saraiva, Valentina Proserpio, Sarah A. Teichmann, Oliver Stegle, John C. Marioni, and Florian Buettner
Methods, 2015, Volume 85, Page 54
[61]
Guini Hong, Hongdong Li, Jiahui Zhang, Qingzhou Guan, Rou Chen, and Zheng Guo
Scientific Reports, 2017, Volume 7, Number 1
[62]
Ashis Saha, Minji Jeon, Aik Choon Tan, Jaewoo Kang, and Chiara Romualdi
PLOS ONE, 2015, Volume 10, Number 7, Page e0131656
[63]
Jaeyun Sung, Pan-Jun Kim, Shuyi Ma, Cory C. Funk, Andrew T. Magis, Yuliang Wang, Leroy Hood, Donald Geman, Nathan D. Price, and Isidore Rigoutsos
PLoS Computational Biology, 2013, Volume 9, Number 7, Page e1003148
[64]
Pankaj Chopra, Jinseung Lee, Jaewoo Kang, Sunwon Lee, and Joel S. Bader
PLoS ONE, 2010, Volume 5, Number 12, Page e14305
[65]
Ashis Saha, Aik Choon Tan, Jaewoo Kang, and Patrick Aloy
PLoS ONE, 2014, Volume 9, Number 1, Page e84227
[66]
Roman Hornung, David Causeur, Christoph Bernau, and Anne-Laure Boulesteix
Bioinformatics, 2016, Page btw650
[67]
SungHwan Kim, Chien-Wei Lin, and George. C. Tseng
Bioinformatics, 2016, Volume 32, Number 13, Page 1966
[68]
Jia Lv, Qinke Peng, Xiao Chen, and Zhi Sun
Expert Systems with Applications, 2016, Volume 59, Page 13
[69]
Yuan Chen, Lifeng Wang, Lanzhi Li, Hongyan Zhang, and Zheming Yuan
BMC Bioinformatics, 2016, Volume 17, Number 1
[70]
C. Lazar, J. Taminau, S. Meganck, D. Steenhoff, A. Coletta, C. Molter, V. de Schaetzen, R. Duque, H. Bersini, and A. Nowe
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2012, Volume 9, Number 4, Page 1106
[71]
Michael F. Ochs, Jason E. Farrar, Michael Considine, Yingying Wei, Soheil Meshinchi, and Robert J. Arceci
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2014, Volume 11, Number 3, Page 520
[72]
Hongyan Zhang, Lanzhi Li, Chao Luo, Congwei Sun, Yuan Chen, Zhijun Dai, and Zheming Yuan
BioMed Research International, 2014, Volume 2014, Page 1
[73]
Lin Zhang, Chunxiang Hao, Xiaopei Shen, Guini Hong, Hongdong Li, Xianxiao Zhou, ChunYang Liu, and Zheng Guo
Breast Cancer Research and Treatment, 2013, Volume 139, Number 2, Page 361
[74]
Ahmed Hossain, Andrew R. Willan, and Joseph Beyene
Communications in Statistics - Simulation and Computation, 2013, Volume 42, Number 7, Page 1563
[75]
R. L. Winslow, N. Trayanova, D. Geman, and M. I. Miller
Science Translational Medicine, 2012, Volume 4, Number 158, Page 158rv11
[76]
Sandra Ramos, Antónia Amaral Turkman, and Marília Antunes
Computational Statistics & Data Analysis, 2010, Volume 54, Number 8, Page 2012
[77]
Alok Sharma and Kuldip K. Paliwal
Data & Knowledge Engineering, 2008, Volume 66, Number 2, Page 338
[78]
Richard J. Oentaryo, Michel Pasquier, and Chai Quek
Expert Systems with Applications, 2011, Volume 38, Number 10, Page 12066
[79]
Gilles Guillot, Maja Olsson, Mikael Benson, and Mats Rudemo
Mathematical Biosciences, 2007, Volume 205, Number 2, Page 195
[80]
Jin-Mao Wei, Shu-Qin Wang, and Xiao-Jie Yuan
IEEE Transactions on Knowledge and Data Engineering, 2010, Volume 22, Number 3, Page 381
[81]
Sébastien Gadat
SIAM Journal on Control and Optimization, 2008, Volume 47, Number 2, Page 904

Comments (0)

Please log in or register to comment.
Log in