Protein Remote Homology Detection Based on Profiles

Qing Liao, Mingyue Guo, Bin Liu*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

As a most important task in protein sequence analysis, protein remote homology detection has been extensively studied for decades. Currently, the profile-based methods show the state-of-the-art performance. Position-Specific Frequency Matrix (PSFM) is a widely used profile. The reason is that this profile contains evolutionary information, which is critical for protein sequence analysis. However, there exists noise information in the profiles introduced by the amino acids with low frequencies, which are not likely to occur in the corresponding sequence positions during evolutionary process. In this study, we propose one method to remove the noise information in the PSFM by removing the amino acids with low frequencies and two a profile can be generated, called Top frequency profile (TFP). Autocross covariance (ACC) transformation is performed on the profile to convert them into fixed length feature vectors. Combined with Support Vector Machines (SVMs), the predictor is constructed. Evaluated on a benchmark dataset, experimental results show that the proposed method outperforms other state-of-the-art predictors for protein remote homology detection, indicating that the proposed method is useful tools for protein sequence analysis. Because the profiles generated from multiple sequence alignments are important for protein structure and function prediction, the TFP will has many potential applications.

Original languageEnglish
Title of host publicationBioinformatics and Biomedical Engineering - 7th International Work-Conference, IWBBIO 2019, Proceedings
EditorsFernando Rojas, Francisco Ortuño, Olga Valenzuela, Francisco Ortuño, Ignacio Rojas
PublisherSpringer Verlag
Pages261-268
Number of pages8
ISBN (Print)9783030179373
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event7th International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2019 - Granada, Spain
Duration: 8 May 201910 May 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11465 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2019
Country/TerritorySpain
CityGranada
Period8/05/1910/05/19

Keywords

  • Protein remote homology detection
  • Top Frequency Profile (TFP)

Fingerprint

Dive into the research topics of 'Protein Remote Homology Detection Based on Profiles'. Together they form a unique fingerprint.

Cite this