Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Sanguinetti, Guido

IMPACT FACTOR 2018: 0.536
5-year IMPACT FACTOR: 0.764

CiteScore 2018: 0.49

SCImago Journal Rank (SJR) 2018: 0.316
Source Normalized Impact per Paper (SNIP) 2018: 0.342

Mathematical Citation Quotient (MCQ) 2018: 0.02

See all formats and pricing
More options …
Volume 11, Issue 1


Volume 10 (2011)

Volume 9 (2010)

Volume 6 (2007)

Volume 5 (2006)

Volume 4 (2005)

Volume 2 (2003)

Volume 1 (2002)

MicroRNA Transcription Start Site Prediction with Multi-objective Feature Selection

Malay Bhattacharyya / Lars Feuerbach / Tapas Bhadra / Thomas Lengauer / Sanghamitra Bandyopadhyay
Published Online: 2012-01-06 | DOI: https://doi.org/10.2202/1544-6115.1743

MicroRNAs (miRNAs) are non-coding, short (21-23nt) regulators of protein-coding genes that are generally transcribed first into primary miRNA (pri-miR), followed by the generation of precursor miRNA (pre-miR). This finally leads to the production of the mature miRNA. A large amount of information is available on the pre- and mature miRNAs. However, very little is known about the pri-miRs, due to a lack of knowledge about their transcription start sites (TSSs). Based on the genomic loci, miRNAs can be categorized into two types —intragenic (intra-miR) and intergenic (inter-miR). While it is already an established fact that intra-miRs are commonly transcribed in conjunction with their host genes, the transcription machinery of inter-miRs is poorly understood. Although it is assumed that miRNA promoters are similar in structure to gene promoters, since both are transcribed by RNA polymerase II (Pol II), computational validations exhibit poor performance of gene promoter prediction methods on miRNAs. In this paper, we concentrate on the problem of TSS prediction for miRNAs. The present study begins with the identification of positive and negative promoter samples from recently published data stemming from RNA-sequencing studies. From these samples of experimentally validated miRNA TSSs, a number of standard sequence features are extracted. Furthermore, to account for potential footprints related to promoter regulation by CpG dinucleotide targeted DNA methylation, a number of novel features are defined. We develop a support vector machine (SVM) with RBF kernel for the prediction of miRNA TSSs trained on human miRNA promoters. A novel feature reduction technique based on archived multi-objective simulated annealing (AMOSA) identifies the final set of features. The resulting model trained on miRNA promoters shows improved performance over the one trained on protein-coding gene promoters in terms of classification accuracy, sensitivity and specificity. Results are also reported for a completely independent biologically validated test set. In a part of the investigation, the proposed approach is used to predict protein-coding gene TSSs. It shows a significantly improved performance when compared to previously published gene TSS prediction methods.

Keywords: transcription start site; feature selection; classification; multi-objective optimization

About the article

Published Online: 2012-01-06

Citation Information: Statistical Applications in Genetics and Molecular Biology, Volume 11, Issue 1, Pages 1–25, ISSN (Online) 1544-6115, DOI: https://doi.org/10.2202/1544-6115.1743.

Export Citation

©2012 Walter de Gruyter GmbH & Co. KG, Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

Yunhe Wang, Bo Liu, Zhiqiang Ma, Ka-Chun Wong, and Xiangtao Li
IEEE Journal of Translational Engineering in Health and Medicine, 2019, Volume 7, Page 1
Malay Bhattacharyya, Manali Das, and Sanghamitra Bandyopadhyay
Genomics, Proteomics & Bioinformatics, 2012, Volume 10, Number 5, Page 310
Scott M. Hammond
Advanced Drug Delivery Reviews, 2015, Volume 87, Page 3
Tapas Bhadra, Malay Bhattacharyya, Lars Feuerbach, Thomas Lengauer, Sanghamitra Bandyopadhyay, and Walter Lukiw
PLoS ONE, 2013, Volume 8, Number 6, Page e66722
Xiangtao Li and Minghao Yin
IEEE Transactions on NanoBioscience, 2013, Volume 12, Number 4, Page 343

Comments (0)

Please log in or register to comment.
Log in