iDRBP_MMC: Identifying DNA-Binding Proteins and RNA-Binding Proteins Based on Multi-Label Learning Model and Motif-Based Convolutional Neural Network

Jun Zhang, Qingcai Chen, Bin Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

55 Citations (Scopus)

Abstract

DNA-binding protein (DBP) and RNA-binding protein (RBP) are playing crucial roles in gene expression. Accurate identification of them is of great significance, and accurately computational predictors are highly required. In previous studies, DBP recognition and RBP recognition were treated as two separate tasks. Because the functional and structural similarities between DBPs and RBPs are high, the DBP predictors tend to predict RBPs as DBPs, while the RBP predictors tend to predict the DBPs as the RBPs, leading to high cross-prediction rate and low prediction precision. Here we introduced a multi-label learning model based on the motif-based convolutional neural network, and a sequence-based computational method called iDRBP_MMC was proposed to solve the cross-prediction problem so as to improve the predictive performance of DBPs and RBPs. The results on four test datasets showed that it outperformed other state-of-the-art DBP predictors and RBP predictors. When applied to analyze the tomato genome, the results reveal the ability of iDRBP_MMC for large-scale data analysis. Moreover, iDRBP_MMC can identify the proteins binding to both DNA and RNA, which is beyond the scope of existing DBP predictors or RBP predictors. The web-server of iDRBP_MMC is freely available at http://bliulab.net/iDRBP_MMC.

Original languageEnglish
Pages (from-to)5860-5875
Number of pages16
JournalJournal of Molecular Biology
Volume432
Issue number22
DOIs
Publication statusPublished - 6 Nov 2020

Keywords

  • cross-prediction problem
  • motif-based convolutional neural network
  • multi-label learning
  • nucleic acid binding protein prediction

Fingerprint

Dive into the research topics of 'iDRBP_MMC: Identifying DNA-Binding Proteins and RNA-Binding Proteins Based on Multi-Label Learning Model and Motif-Based Convolutional Neural Network'. Together they form a unique fingerprint.

Cite this