Protein remote homology detection by combining pseudo Dimer composition with an ensemble learning method

Bin Liu*, Junjie Chen, Shanyi Wang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Background: With the development of the next generation sequencing technique in biology, more and more protein sequence data is generated exponentially. However, the protein structure data grows slowly. The gap between them is growing large. The protein remote homology detection becomes an important and intense research problem. Objective: Although several methods have been reported to tackle this problem, their performance is still too low to be used for real world application. Therefore, it is necessary and urgent to characterize protein sequences from a new perspective so as to improve the predictive performance of protein remote homology detection. Method: In this study, we proposed a new feature of proteins called Pseudo Dimer Composition (PDC). A new computational method for protein remote homology detection called PDC-Ensemble was constructed by combining PDC via an ensemble learning approach. Result: Experimental results on a public benchmark dataset showed that the performance of PDC-Ensemble outperformed other sequence-based methods, and is highly comparable with some state-of-the-art predictors in the field of protein remote homology detection. Conclusion: PDC can extract more dipeptide information. PDC-Ensemble is a useful tool for the studies of protein remote homology detection.

Original languageEnglish
Pages (from-to)86-91
Number of pages6
JournalCurrent Proteomics
Volume13
Issue number2
DOIs
Publication statusPublished - 1 Jun 2016
Externally publishedYes

Keywords

  • Ensemble learning
  • Protein remote homology detection
  • Pseudo Dimer composition

Fingerprint

Dive into the research topics of 'Protein remote homology detection by combining pseudo Dimer composition with an ensemble learning method'. Together they form a unique fingerprint.

Cite this