NCBRPred: Predicting nucleic acid binding residues in proteins based on multilabel learning

Jun Zhang, Qingcai Chen, Bin Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

35 Citations (Scopus)

Abstract

The interactions between proteins and nucleic acid sequences play many important roles in gene expression and some cellular activities. Accurate prediction of the nucleic acid binding residues in proteins will facilitate the research of the protein functions, gene expression, drug design, etc. In this regard, several computational methods have been proposed to predict the nucleic acid binding residues in proteins. However, these methods cannot satisfactorily measure the global interactions among the residues along protein. Furthermore, these methods are suffering cross-prediction problem, new strategies should be explored to solve this problem. In this study, a new computational method called NCBRPred was proposed to predict the nucleic acid binding residues based on the multilabel sequence labeling model. NCBRPred used the bidirectional Gated Recurrent Units (BiGRUs) to capture the global interactions among the residues, and treats this task as a multilabel learning task. Experimental results on three widely used benchmark datasets and an independent dataset showed that NCBRPred achieved higher predictive results with lower cross-prediction, outperforming 10 existing state-of-The-Art predictors. The web-server and a stand-Alone package of NCBRPred are freely available at http://bliulab.net/NCBRPred. It is anticipated that NCBRPred will become a very useful tool for identifying nucleic acid binding residues.

Original languageEnglish
Article numberbbaa397
JournalBriefings in Bioinformatics
Volume22
Issue number5
DOIs
Publication statusPublished - 1 Sept 2021
Externally publishedYes

Keywords

  • cross-prediction problem
  • multilabel learning
  • nucleic acid binding residue prediction
  • sequence labeling model

Fingerprint

Dive into the research topics of 'NCBRPred: Predicting nucleic acid binding residues in proteins based on multilabel learning'. Together they form a unique fingerprint.

Cite this