IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

Junjie Chen; Xiaolong Wang; Bin Liu

doi:10.1038/srep19062

IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

Junjie Chen, Xiaolong Wang, Bin Liu^*

^*此作品的通讯作者

Harbin Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

53 引用（Scopus）

摘要

The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.

源语言	英语
文章编号	19062
期刊	Scientific Reports
卷	6
DOI	https://doi.org/10.1038/srep19062
出版状态	已出版 - 12 1月 2016
已对外发布	是

访问文件

10.1038/srep19062

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{babd99d097ac42e38b53263f124a1573,

title = "IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions",

abstract = "The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.",

author = "Junjie Chen and Xiaolong Wang and Bin Liu",

year = "2016",

month = jan,

day = "12",

doi = "10.1038/srep19062",

language = "English",

volume = "6",

journal = "Scientific Reports",

issn = "2045-2322",

publisher = "Nature Publishing Group",

}

TY - JOUR

T1 - IMiRNA-SSF

T2 - Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

AU - Chen, Junjie

AU - Wang, Xiaolong

AU - Liu, Bin

PY - 2016/1/12

Y1 - 2016/1/12

N2 - The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.

AB - The identification of microRNA precursors (pre-miRNAs) helps in understanding regulator in biological processes. The performance of computational predictors depends on their training sets, in which the negative sets play an important role. In this regard, we investigated the influence of benchmark datasets on the predictive performance of computational predictors in the field of miRNA identification, and found that the negative samples have significant impact on the predictive results of various methods. We constructed a new benchmark set with different data distributions of negative samples. Trained with this high quality benchmark dataset, a new computational predictor called iMiRNA-SSF was proposed, which employed various features extracted from RNA sequences. Experimental results showed that iMiRNA-SSF outperforms three state-of-the-art computational methods. For practical applications, a web-server of iMiRNA-SSF was established at the website http://bioinformatics.hitsz.edu.cn/iMiRNA-SSF/.

UR - http://www.scopus.com/inward/record.url?scp=84954423740&partnerID=8YFLogxK

U2 - 10.1038/srep19062

DO - 10.1038/srep19062

M3 - Article

C2 - 26753561

AN - SCOPUS:84954423740

SN - 2045-2322

VL - 6

JO - Scientific Reports

JF - Scientific Reports

M1 - 19062

ER -

IMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions

摘要

访问文件

其它文件与链接

指纹

引用此