Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido

IMPACT FACTOR 2018: 0.536
5-year IMPACT FACTOR: 0.764

CiteScore 2018: 0.49

SCImago Journal Rank (SJR) 2018: 0.316
Source Normalized Impact per Paper (SNIP) 2018: 0.342

Mathematical Citation Quotient (MCQ) 2017: 0.04

See all formats and pricing
More options …
Volume 3, Issue 1


Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

Classifying Gene Expression Profiles from Pairwise mRNA Comparisons

Donald Geman / Christian d'Avignon / Daniel Q. Naiman / Raimond L. Winslow
Published Online: 2004-08-30 | DOI: https://doi.org/10.2202/1544-6115.1071

We present a new approach to molecular classification based on mRNA comparisons. Our method, referred to as the top-scoring pair(s) (TSP) classifier, is motivated by current technical and practical limitations in using gene expression microarray data for class prediction, for example to detect disease, identify tumors or predict treatment response. Accurate statistical inference from such data is difficult due to the small number of observations, typically tens, relative to the large number of genes, typically thousands. Moreover, conventional methods from machine learning lead to decisions which are usually very difficult to interpret in simple or biologically meaningful terms. In contrast, the TSP classifier provides decision rules which i) involve very few genes and only relative expression values (e.g., comparing the mRNA counts within a single pair of genes); ii) are both accurate and transparent; and iii) provide specific hypotheses for follow-up studies. In particular, the TSP classifier achieves prediction rates with standard cancer data that are as high as those of previous studies which use considerably more genes and complex procedures. Finally, the TSP classifier is parameter-free, thus avoiding the type of over-fitting and inflated estimates of performance that result when all aspects of learning a predictor are not properly cross-validated.

Keywords: microarray data; class prediction; mRNA comparisons

About the article

Published Online: 2004-08-30

Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 3, Issue 1, Pages 1–19, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1071.

Export Citation

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

Sameer Salunkhe, Naren Chandran, Pratik Chandrani, Amit Dutt, and Shilpee Dutt
Briefings in Bioinformatics, 2018
Patrick Slama, Michael R. Hoopmann, Robert L. Moritz, and Donald Geman
Molecular Omics, 2018
Dominik Langgartner, Andrea M. Füchsl, Lisa M. Kaiser, Tatjana Meier, Sandra Foertsch, Christian Buske, Stefan O. Reber, Medhanie A. Mulaw, and Yvette Tache
PLOS ONE, 2018, Volume 13, Number 9, Page e0202471
Te-Yao Hsu, Jyun-Mu Lin, Mai-Huong T. Nguyen, Feng-Hsiang Chung, Ching-Chang Tsai, Hsin-Hsin Cheng, Yun-Ju Lai, Hsuan-Ning Hung, and Chien-Sheng Chen
Molecular & Cellular Proteomics, 2018, Volume 17, Number 8, Page 1457
Guini Hong, Hongdong Li, Mengyao Li, Weicheng Zheng, Jing Li, Meirong Chi, Jun Cheng, and Zheng Guo
Briefings in Bioinformatics, 2018, Volume 19, Number 4, Page 613
Dimitri Kagaris, Alireza Khamesipour, and Constantin T. Yiannoutsos
BMC Bioinformatics, 2018, Volume 19, Number 1
Yunlong Jiao and Jean-Philippe Vert
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, Volume 40, Number 7, Page 1755
Xin Huang, Xiaohui Lin, Lina Zhou, and Benzhe Su
Journal of Pharmaceutical and Biomedical Analysis, 2018
Wikum Dinalankara, Qian Ke, Yiran Xu, Lanlan Ji, Nicole Pagane, Anching Lien, Tejasvi Matam, Elana J. Fertig, Nathan D. Price, Laurent Younes, Luigi Marchionni, and Donald Geman
Proceedings of the National Academy of Sciences, 2018, Page 201721628
Sarah E. Dickinson, Brock A. Griffin, Michelle F. Elmore, Lisa Kriese-Anderson, Joshua B. Elmore, Paul W. Dyce, Soren P. Rodning, and Fernando H. Biase
BMC Genomics, 2018, Volume 19, Number 1
Jochen Hochrein, Matthias S. Klein, Helena U. Zacharias, Juan Li, Gene Wijffels, Horst Joachim Schirra, Rainer Spang, Peter J. Oefner, and Wolfram Gronwald
Journal of Proteome Research, 2012, Volume 11, Number 12, Page 6242
Huaping Liu, Yawei Li, Jun He, Qingzhou Guan, Rou Chen, Haidan Yan, Weicheng Zheng, Kai Song, Hao Cai, You Guo, Xianlong Wang, and Zheng Guo
BMC Genomics, 2017, Volume 18, Number 1
Pengwei Xing, Yuan Chen, Jun Gao, Lianyang Bai, and Zheming Yuan
Scientific Reports, 2017, Volume 7, Number 1
Rex Shen, Lan Luo, and Hui Jiang
BMC Bioinformatics, 2017, Volume 18, Number 1
Xin Huang, Xiaohui Lin, Jun Zeng, Lichao Wang, Peiyuan Yin, Lina Zhou, Chunxiu Hu, and Weihong Yao
Scientific Reports, 2017, Volume 7, Number 1
Sunanda Das and Asit Kumar Das
International Journal of Rough Sets and Data Analysis, 2018, Volume 5, Number 1, Page 1
Benjamin Ulfenborg, Karin Klinga-Levan, and Björn Olsson
Cancer Informatics, 2013, Volume 12, Page CIN.S10356
Endre Sebestyén, Michał Zawisza, and Eduardo Eyras
Nucleic Acids Research, 2015, Volume 43, Number 3, Page 1345
Xiaosheng Wang and Osamu Gotoh
Cancer Informatics, 2009, Volume 7, Page CIN.S2655
Xiaosheng Wang and Osamu Gotoh
Cancer Informatics, 2010, Volume 9, Page CIN.S3794
Richard Simon, Amy Lam, Ming-Chung Li, Michael Ngan, Supriya Menenzes, and Yingdong Zhao
Cancer Informatics, 2007, Volume 3, Page 117693510700300
Tianzhou Ma, Chi Song, and George C. Tseng
Statistical Modelling: An International Journal, 2017, Volume 17, Number 4-5, Page 305
Thomas J. Fuchs and Joachim M. Buhmann
Computerized Medical Imaging and Graphics, 2011, Volume 35, Number 7-8, Page 515
Hsi-Che Liu, Chien-Yu Chen, Yu-Ting Liu, Cheng-Bang Chu, Der-Cherng Liang, Lee-Yung Shih, and Chih-Jen Lin
Journal of Biomedical Informatics, 2008, Volume 41, Number 4, Page 570
Xiaosheng Wang
Genomics, 2012, Volume 99, Number 2, Page 90
David Weisman, Hong Liu, Jessica Redfern, Liya Zhu, and Adán Colón-Carmona
Environmental Science & Technology, 2011, Volume 45, Number 12, Page 5132
Nathan D. Price, Greg Foltz, Anup Madan, Leroy Hood, and Qiang Tian
Journal of Cellular and Molecular Medicine, 2007, Volume 12, Number 1, Page 97
Mario R. Guarracino, Altannar Chinchuluun, and Panos M. Pardalos
Optimization Letters, 2009, Volume 3, Number 3, Page 357
Hongdong Li, Guini Hong, Hui Xu, and Zheng Guo
Gene, 2015, Volume 555, Number 2, Page 203
Jeffrey L. Ebersole, Dolph Dawson, Pinar Emecen-Huja, Radhakrishnan Nagarajan, Katherine Howard, Martha E. Grady, Katherine Thompson, Rebecca Peyyala, Ahmad Al-Attar, Kathryn Lethbridge, Sreenatha Kirakodu, and Octavio A. Gonzalez
Periodontology 2000, 2017, Volume 75, Number 1, Page 52
Komal S. Rathi, Daria A. Gaykalova, Patrick Hennessey, Joseph A. Califano, and Michael F. Ochs
Drug Development Research, 2014, Volume 75, Number 6, Page 343
Troy J. Anderson, Irina Tchernyshyov, Roberto Diez, Robert N. Cole, Donald Geman, Chi V. Dang, and Raimond L. Winslow
PROTEOMICS, 2007, Volume 7, Number 8, Page 1197
Wanwei Zhang, Tao Zeng, and Luonan Chen
Journal of Theoretical Biology, 2014, Volume 362, Page 35
Claudio Isella, Francesco Brundu, Sara E. Bellomo, Francesco Galimi, Eugenia Zanella, Roberta Porporato, Consalvo Petti, Alessandro Fiori, Francesca Orzan, Rebecca Senetta, Carla Boccaccio, Elisa Ficarra, Luigi Marchionni, Livio Trusolino, Enzo Medico, and Andrea Bertotti
Nature Communications, 2017, Volume 8, Page 15107
Margaret Calciano, Jean Christophe Lemarié, Elodie Blondiaux, Richard Einstein, and Pascale Fehlbaum-Beurdeley
Biomarkers, 2013, Volume 18, Number 3, Page 264
Blaise Hanczar, Jean-Daniel Zucker, Corneliu Henegar, and Lorenza Saitta
Bioinformatics, 2007, Volume 23, Number 21, Page 2866
Bahman Afsari, Elana J. Fertig, Donald Geman, and Luigi Marchionni
Bioinformatics, 2015, Volume 31, Number 2, Page 273
Hongwei Wang, Qiang Sun, Wenyuan Zhao, Lishuang Qi, Yunyan Gu, Pengfei Li, Mengmeng Zhang, Yang Li, Shu-Lin Liu, and Zheng Guo
Bioinformatics, 2015, Volume 31, Number 1, Page 62
Andrew T. Magis, John C. Earls, Youn-Hee Ko, James A. Eddy, and Nathan D. Price
Bioinformatics, 2011, Volume 27, Number 6, Page 872
C. Nichita, L. Ciarloni, S. Monnier-Benoit, S. Hosseinian, G. Dorta, and C. Rüegg
Alimentary Pharmacology & Therapeutics, 2014, Volume 39, Number 5, Page 507
Armando Fernandes, Susana Vinga, and Collin M. Stultz
PLOS ONE, 2016, Volume 11, Number 3, Page e0150369
J. T. Poirier, Irina Dobromilskaya, Whei F. Moriarty, Craig D. Peacock, Christine L. Hann, and Charles M. Rudin
JNCI: Journal of the National Cancer Institute, 2013, Volume 105, Number 14, Page 1059
Fabien Crauste, Julien Mafille, Lilia Boucinha, Sophia Djebali, Olivier Gandrillon, Jacqueline Marvel, and Christophe Arpin
Cell Systems, 2017, Volume 4, Number 3, Page 306
Tessa E. Pronk, Eugene P. van Someren, Rob H. Stierum, Janine Ezendam, and Jeroen L.A. Pennings
Journal of Applied Toxicology, 2013, Volume 33, Number 12, Page 1407
George R. Saade, Kim A. Boggess, Scott A. Sullivan, Glenn R. Markenson, Jay D. Iams, Dean V. Coonrod, Leonardo M. Pereira, M. Sean Esplin, Larry M. Cousins, Garrett K. Lam, Matthew K. Hoffman, Robert D. Severinsen, Trina Pugmire, Jeff S. Flick, Angela C. Fox, Amir J. Lueth, Sharon R. Rust, Emanuele Mazzola, ChienTing Hsu, Max T. Dufford, Chad L. Bradford, Ilia E. Ichetovkin, Tracey C. Fleischer, Ashoka D. Polpitiya, Gregory C. Critchfield, Paul E. Kearney, J. Jay Boniface, and Durlin E. Hickok
American Journal of Obstetrics and Gynecology, 2016, Volume 214, Number 5, Page 633.e1
Cynthia J. Carlyn, Nancy J. Andersen, Aldona L. Baltch, Raymond Smith, Andrew A. Reilly, and David A. Lawrence
Diagnostic Microbiology and Infectious Disease, 2015, Volume 83, Number 3, Page 312
Hyunjin Kim, Jaegyoon Ahn, Chihyun Park, Youngmi Yoon, and Sanghyun Park
Computers in Biology and Medicine, 2013, Volume 43, Number 10, Page 1363
James A. Eddy, Jaeyun Sung, Donald Geman, and Nathan D. Price
Technology in Cancer Research & Treatment, 2010, Volume 9, Number 2, Page 149
Shuyi Ma, Jaeyun Sung, Andrew T. Magis, Yuliang Wang, Donald Geman, Nathan D. Price, and Chuhsing Kate Hsiao
PLoS ONE, 2014, Volume 9, Number 10, Page e110840
James A. Eddy, Leroy Hood, Nathan D. Price, Donald Geman, and Doheon Lee
PLoS Computational Biology, 2010, Volume 6, Number 5, Page e1000792
Tyler A. Herek, Timothy D. Shew, Heather N. Spurgin, Christine E. Cutucache, and Ivan R Nabi
PLOS ONE, 2015, Volume 10, Number 11, Page e0142682
Mehrab Ghanat Bari, Sirajul Salekin, and Jianqiu Michelle Zhang
Molecular Informatics, 2017, Volume 36, Number 4, Page 1600099
Nicolas Ugolin, Catherine Ory, Emilie Lefevre, Nora Benhabiles, Paul Hofman, Martin Schlumberger, Sylvie Chevillard, and Ying Xu
PLoS ONE, 2011, Volume 6, Number 8, Page e23581
Youssef M. Youssef, Nicole M.A. White, Jörg Grigull, Adriana Krizova, Christina Samy, Salvador Mejia-Guerrero, Andrew Evans, and George M. Yousef
European Urology, 2011, Volume 59, Number 5, Page 721
Lishuang Qi, Yang Li, Yuan Qin, Gengen Shi, Tianhao Li, Jiasheng Wang, Libin Chen, Yunyan Gu, Wenyuan Zhao, and Zheng Guo
British Journal of Cancer, 2016, Volume 115, Number 12, Page 1513
Merja Heinäniemi, Matti Nykter, Roger Kramer, Anke Wienecke-Baldacchino, Lasse Sinkkonen, Joseph Xu Zhou, Richard Kreisberg, Stuart A Kauffman, Sui Huang, and Ilya Shmulevich
Nature Methods, 2013, Volume 10, Number 6, Page 577
Li Wang, Yixuan Gong, Uma Chippada-Venkata, Matthias Michael Heck, Margitta Retz, Roman Nawroth, Matthew Galsky, Che-Kai Tsao, Eric Schadt, Johann de Bono, David Olmos, Jun Zhu, and William K. Oh
BMC Medicine, 2015, Volume 13, Number 1
Antonio Scialdone, Kedar N. Natarajan, Luis R. Saraiva, Valentina Proserpio, Sarah A. Teichmann, Oliver Stegle, John C. Marioni, and Florian Buettner
Methods, 2015, Volume 85, Page 54
Guini Hong, Hongdong Li, Jiahui Zhang, Qingzhou Guan, Rou Chen, and Zheng Guo
Scientific Reports, 2017, Volume 7, Number 1
Ashis Saha, Minji Jeon, Aik Choon Tan, Jaewoo Kang, and Chiara Romualdi
PLOS ONE, 2015, Volume 10, Number 7, Page e0131656
Jaeyun Sung, Pan-Jun Kim, Shuyi Ma, Cory C. Funk, Andrew T. Magis, Yuliang Wang, Leroy Hood, Donald Geman, Nathan D. Price, and Isidore Rigoutsos
PLoS Computational Biology, 2013, Volume 9, Number 7, Page e1003148
Pankaj Chopra, Jinseung Lee, Jaewoo Kang, Sunwon Lee, and Joel S. Bader
PLoS ONE, 2010, Volume 5, Number 12, Page e14305
Ashis Saha, Aik Choon Tan, Jaewoo Kang, and Patrick Aloy
PLoS ONE, 2014, Volume 9, Number 1, Page e84227
Roman Hornung, David Causeur, Christoph Bernau, and Anne-Laure Boulesteix
Bioinformatics, 2016, Page btw650
SungHwan Kim, Chien-Wei Lin, and George. C. Tseng
Bioinformatics, 2016, Volume 32, Number 13, Page 1966
Jia Lv, Qinke Peng, Xiao Chen, and Zhi Sun
Expert Systems with Applications, 2016, Volume 59, Page 13
Yuan Chen, Lifeng Wang, Lanzhi Li, Hongyan Zhang, and Zheming Yuan
BMC Bioinformatics, 2016, Volume 17, Number 1
C. Lazar, J. Taminau, S. Meganck, D. Steenhoff, A. Coletta, C. Molter, V. de Schaetzen, R. Duque, H. Bersini, and A. Nowe
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2012, Volume 9, Number 4, Page 1106
Michael F. Ochs, Jason E. Farrar, Michael Considine, Yingying Wei, Soheil Meshinchi, and Robert J. Arceci
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2014, Volume 11, Number 3, Page 520
Hongyan Zhang, Lanzhi Li, Chao Luo, Congwei Sun, Yuan Chen, Zhijun Dai, and Zheming Yuan
BioMed Research International, 2014, Volume 2014, Page 1
Lin Zhang, Chunxiang Hao, Xiaopei Shen, Guini Hong, Hongdong Li, Xianxiao Zhou, ChunYang Liu, and Zheng Guo
Breast Cancer Research and Treatment, 2013, Volume 139, Number 2, Page 361
Ahmed Hossain, Andrew R. Willan, and Joseph Beyene
Communications in Statistics - Simulation and Computation, 2013, Volume 42, Number 7, Page 1563
R. L. Winslow, N. Trayanova, D. Geman, and M. I. Miller
Science Translational Medicine, 2012, Volume 4, Number 158, Page 158rv11
Sandra Ramos, Antónia Amaral Turkman, and Marília Antunes
Computational Statistics & Data Analysis, 2010, Volume 54, Number 8, Page 2012
Alok Sharma and Kuldip K. Paliwal
Data & Knowledge Engineering, 2008, Volume 66, Number 2, Page 338
Richard J. Oentaryo, Michel Pasquier, and Chai Quek
Expert Systems with Applications, 2011, Volume 38, Number 10, Page 12066
Gilles Guillot, Maja Olsson, Mikael Benson, and Mats Rudemo
Mathematical Biosciences, 2007, Volume 205, Number 2, Page 195
Jin-Mao Wei, Shu-Qin Wang, and Xiao-Jie Yuan
IEEE Transactions on Knowledge and Data Engineering, 2010, Volume 22, Number 3, Page 381
Sébastien Gadat
SIAM Journal on Control and Optimization, 2008, Volume 47, Number 2, Page 904

Comments (0)

Please log in or register to comment.
Log in