IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

Junjie Chen; Xiaolong Wang; Bin Liu

doi:10.1038/srep19062

IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

Junjie Chen, Xiaolong Wang, Bin Liu^*

^*Corresponding author for this work

Harbin Institute of Technology

Research output: Contribution to journal › Article › peer-review

54 Citations (Scopus)

Abstract

The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.

Original language	English
Article number	19062
Journal	Scientific Reports
Volume	6
DOIs	https://doi.org/10.1038/srep19062
Publication status	Published - 12 Jan 2016
Externally published	Yes

Access to Document

10.1038/srep19062

Cite this

Chen, J., Wang, X., & Liu, B. (2016). IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions. Scientific Reports, 6, Article 19062. https://doi.org/10.1038/srep19062

@article{babd99d097ac42e38b53263f124a1573,

title = "IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions",

abstract = "The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.",

author = "Junjie Chen and Xiaolong Wang and Bin Liu",

year = "2016",

month = jan,

day = "12",

doi = "10.1038/srep19062",

language = "English",

volume = "6",

journal = "Scientific Reports",

issn = "2045-2322",

publisher = "Nature Publishing Group",

}

TY - JOUR

T1 - IMiRNA-SSF

T2 - Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

AU - Chen, Junjie

AU - Wang, Xiaolong

AU - Liu, Bin

PY - 2016/1/12

Y1 - 2016/1/12

N2 - The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.

AB - The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.

UR - http://www.scopus.com/inward/record.url?scp=84954423740&partnerID=8YFLogxK

U2 - 10.1038/srep19062

DO - 10.1038/srep19062

M3 - Article

C2 - 26753561

AN - SCOPUS:84954423740

SN - 2045-2322

VL - 6

JO - Scientific Reports

JF - Scientific Reports

M1 - 19062

ER -

IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

Abstract

Access to Document

Other files and links

Fingerprint

Cite this