An ensemble learning framework for potential miRNA-disease association prediction with positive-unlabeled data

Yao Wu, Donghua Zhu, Xuefeng Wang*, Shuo Zhang

*此作品的通讯作者

    科研成果: 期刊稿件文章同行评审

    8 引用 (Scopus)

    摘要

    To explore the pathogenic mechanisms of MicroRNA (miRNA) on diverse diseases, many researchers have concentrated on discovering the potential associations between miRNA and disease using machine learning methods. However, the prediction accuracy of supervised machine learning methods is limited by lacking of experimentally-validated uncorrelated miRNA-disease pairs. Without these negative samples, training a highly accurate model is much more difficult. Different from traditional miRNA-disease prediction models using randomly selected unknown samples as negative training samples, we propose an ensemble learning framework to solve this positive-unlabeled (PU) learning problem. The framework incorporates two steps, i.e., a novel semi-supervised Kmeans (SS-Kmeans) to extract reliable negative samples from unknown miRNA-disease pairs and subagging method to generate diverse training sample sets to make full use of those reliable negative samples for ensemble learning. Combined with effective random vector functional link (RVFL) network as prediction model, the proposed framework showed superior prediction accuracy comparing with other popular approaches. A case study on lung and gastric neoplasms further confirms the framework's efficacy at identifying miRNA disease associations.

    源语言英语
    文章编号107566
    期刊Computational Biology and Chemistry
    95
    DOI
    出版状态已出版 - 12月 2021

    指纹

    探究 'An ensemble learning framework for potential miRNA-disease association prediction with positive-unlabeled data' 的科研主题。它们共同构成独一无二的指纹。

    引用此