Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido

IMPACT FACTOR 2018: 0.536
5-year IMPACT FACTOR: 0.764

CiteScore 2018: 0.49

SCImago Journal Rank (SJR) 2018: 0.316
Source Normalized Impact per Paper (SNIP) 2018: 0.342

Mathematical Citation Quotient (MCQ) 2018: 0.02

See all formats and pricing
More options …
Volume 11, Issue 2


Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

Querying Genomic Databases: Refining the Connectivity Map

Mark R. Segal / Hao Xiong / Henrik Bengtsson / Richard Bourgon / Robert Gentleman
Published Online: 2012-01-06 | DOI: https://doi.org/10.2202/1544-6115.1715

The advent of high-throughput biotechnologies, which can efficiently measure gene expression on a global basis, has led to the creation and population of correspondingly rich databases and compendia. Such repositories have the potential to add enormous scientific value beyond that provided by individual studies which, due largely to cost considerations, are typified by small sample sizes. Accordingly, substantial effort has been invested in devising analysis schemes for utilizing gene-expression repositories. Here, we focus on one such scheme, the Connectivity Map (cmap), that was developed with the express purpose of identifying drugs with putative efficacy against a given disease, where the disease in question is characterized by a (differential) gene-expression signature. Initial claims surrounding cmap intimated that such tools might lead to new, previously unanticipated applications of existing drugs. However, further application suggests that its primary utility is in connecting a disease condition whose biology is largely unknown to a drug whose mechanisms of action are well understood, making cmap a tool for enhancing biological knowledge.The success of the Connectivity Map is belied by its simplicity. The aforementioned signature serves as an unordered query which is applied to a customized database of (differential) gene-expression experiments designed to elicit response to a wide range of drugs, across of spectrum of concentrations, durations, and cell lines. Such application is effected by computing a per experiment score that measures "closeness" between the signature and the experiment. Top-scoring experiments, and the attendant drug(s), are then deemed relevant to the disease underlying the query. Inference supporting such elicitations is pursued via re-sampling. In this paper, we revisit two key aspects of the Connectivity Map implementation. Firstly, we develop new approaches to measuring closeness for the common scenario wherein the query constitutes an ordered list. These involve using metrics proposed for analyzing partially ranked data, these being of interest in their own right and not widely used. Secondly, we advance an alternate inferential approach based on generating empirical null distributions that exploit the scope, and capture dependencies, embodied by the database. Using these refinements we undertake a comprehensive re-evaluation of Connectivity Map findings that, in general terms, reveal that accommodating ordered queries is less critical than the mode of inference.

Keywords: gene expression; symmetric group; partial ranking; empirical null

About the article

Published Online: 2012-01-06

Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 11, Issue 2, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1715.

Export Citation

©2012 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

Nicholas Thomford, Dimakatso Senthebane, Arielle Rowe, Daniella Munro, Palesa Seele, Alfred Maroyi, and Kevin Dzobo
International Journal of Molecular Sciences, 2018, Volume 19, Number 6, Page 1578
Gayathri Thillaiyampalam, Fabio Liberante, Liam Murray, Chris Cardwell, Ken Mills, and Shu-Dong Zhang
BMC Bioinformatics, 2017, Volume 18, Number 1
Arun J. Singh, Stephen A. Ramsey, Theresa M. Filtz, and Chrissa Kioussi
Cellular and Molecular Life Sciences, 2017
Aliyu Musa, Laleh Soltan Ghoraie, Shu-Dong Zhang, Galina Galzko, Olli Yli-Harja, Matthias Dehmer, Benjamin Haibe-Kains, and Frank Emmert-Streib
Briefings in Bioinformatics, 2017, Page bbw112
Li Shen, Lizhi Zhao, Jiquan Tang, Zhiwei Wang, Weisong Bai, Feng Zhang, Shouli Wang, and Weihua Li
Pathology & Oncology Research, 2017, Volume 23, Number 4, Page 745
Qing Wen, Chang-Sik Kim, Peter W. Hamilton, and Shu-Dong Zhang
BMC Bioinformatics, 2016, Volume 17, Number 1

Comments (0)

Please log in or register to comment.
Log in