TY - JOUR
T1 - iPiDA-sHN
T2 - Identification of Piwi-interacting RNA-disease associations by selecting high quality negative samples
AU - Wei, Hang
AU - Ding, Yuxin
AU - Liu, Bin
N1 - Publisher Copyright:
© 2020 Elsevier Ltd
PY - 2020/10
Y1 - 2020/10
N2 - As a large group of small non-coding RNAs (ncRNAs), Piwi-interacting RNAs (piRNAs) have been detected to be associated with various diseases. Identifying disease associated piRNAs can provide promising candidate molecular targets to promote the drug design. Although, a few computational ensemble methods have been developed for identifying piRNA-disease associations, the low-quality negative associations even with positive associations used during the training process prevent the predictive performance improvement. In this study, we proposed a new computational predictor named iPiDA-sHN to predict potential piRNA-disease associations. iPiDA-sHN presented the piRNA-disease pairs by incorporating piRNA sequence information, the known piRNA-disease association network, and the disease semantic graph. High-level features of piRNA-disease associations were extracted by the Convolutional Neural Network (CNN). Two-step positive-unlabeled learning strategy based on Support Vector Machine (SVM) was employed to select the high quality negative samples from the unknown piRNA-disease pairs. Finally, the SVM predictor trained with the known piRNA-disease associations and the high quality negative associations was used to predict new piRNA-disease associations. The experimental results showed that iPiDA-sHN achieved superior predictive ability compared with other state-of-the-art predictors.
AB - As a large group of small non-coding RNAs (ncRNAs), Piwi-interacting RNAs (piRNAs) have been detected to be associated with various diseases. Identifying disease associated piRNAs can provide promising candidate molecular targets to promote the drug design. Although, a few computational ensemble methods have been developed for identifying piRNA-disease associations, the low-quality negative associations even with positive associations used during the training process prevent the predictive performance improvement. In this study, we proposed a new computational predictor named iPiDA-sHN to predict potential piRNA-disease associations. iPiDA-sHN presented the piRNA-disease pairs by incorporating piRNA sequence information, the known piRNA-disease association network, and the disease semantic graph. High-level features of piRNA-disease associations were extracted by the Convolutional Neural Network (CNN). Two-step positive-unlabeled learning strategy based on Support Vector Machine (SVM) was employed to select the high quality negative samples from the unknown piRNA-disease pairs. Finally, the SVM predictor trained with the known piRNA-disease associations and the high quality negative associations was used to predict new piRNA-disease associations. The experimental results showed that iPiDA-sHN achieved superior predictive ability compared with other state-of-the-art predictors.
KW - Convolutional neural network
KW - High quality negative sample
KW - Positive-unlabeled learning
KW - piRNA-disease associations
UR - http://www.scopus.com/inward/record.url?scp=85090336101&partnerID=8YFLogxK
U2 - 10.1016/j.compbiolchem.2020.107361
DO - 10.1016/j.compbiolchem.2020.107361
M3 - Article
C2 - 32916452
AN - SCOPUS:85090336101
SN - 1476-9271
VL - 88
JO - Computational Biology and Chemistry
JF - Computational Biology and Chemistry
M1 - 107361
ER -