iDRBP-ECHF: Identifying DNA- and RNA-binding proteins based on extensible cubic hybrid framework

Jiawei Feng, Ning Wang, Jun Zhang, Bin Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Proteins interact with nucleic acids to regulate the life activities of organisms. Therefore, how to accurately and efficiently identify nucleic acid-binding proteins (NABPs) is particularly significant. Some sequence-based computational methods have been proposed to identify DNA- and RNA-binding proteins in previous studies. However, the benchmark datasets used by these methods ignore the proportion of NABPs in the real world, and some integration methods only integrate traditional machine learning algorithms, resulting in limited prediction performance. In this study, we proposed a sequence-based method called iDRBP-ECHF to predict the DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs). We constructed a benchmark dataset by considering the proportion of positive and negative samples in the real world, and used down-sampling to generate three relatively balanced datasets to train the iDRBP-ECHF. In addition, we incorporated the deep learning algorithms into the framework to obtain a more compact high-level feature representation of the input data. The results on two independent datasets show that it achieves the most advanced performance and is superior to the other existing sequence-based DBP and RBP prediction methods. In addition, we set up a webserver iDRBP-ECHF, which can be accessed at http://bliulab.net/iDRBP-ECHF.

Original languageEnglish
Article number105940
JournalComputers in Biology and Medicine
Volume149
DOIs
Publication statusPublished - Oct 2022

Keywords

  • DNA- and RNA-binding proteins identification
  • Extensible cubic hybrid framework
  • Machine learning
  • Multi-label learning

Fingerprint

Dive into the research topics of 'iDRBP-ECHF: Identifying DNA- and RNA-binding proteins based on extensible cubic hybrid framework'. Together they form a unique fingerprint.

Cite this