SMI-BLAST: A novel supervised search framework based on PSI-BLAST for protein remote homology detection

Xiaopeng Jin, Qing Liao, Hang Wei, Jun Zhang, Bin Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

24 Citations (Scopus)

Abstract

Motivation: As one of the most important and widely used mainstream iterative search tool for protein sequence search, an accurate Position-Specific Scoring Matrix (PSSM) is the key of PSI-BLAST. However, PSSMs containing non-homologous information obviously reduce the performance of PSI-BLAST for protein remote homology. Results: To further study this problem, we summarize three types of Incorrectly Selected Homology (ISH) errors in PSSMs. A new search tool Supervised-Manner-based Iterative BLAST (SMI-BLAST) is proposed based on PSI-BLAST for solving these errors. SMI-BLAST obviously outperforms PSI-BLAST on the Structural Classification of Proteins-extended (SCOPe) dataset. Compared with PSI-BLAST on the ISH error subsets of SCOPe dataset, SMI-BLAST detects 1.6-2.87 folds more remote homologous sequences, and outperforms PSI-BLAST by 35.66% in terms of ROC1 scores. Furthermore, this framework is applied to JackHMMER, DELTA-BLAST and PSI-BLASTexB, and their performance is further improved.

Original languageEnglish
Pages (from-to)913-920
Number of pages8
JournalBioinformatics
Volume37
Issue number7
DOIs
Publication statusPublished - 1 Apr 2021

Fingerprint

Dive into the research topics of 'SMI-BLAST: A novel supervised search framework based on PSI-BLAST for protein remote homology detection'. Together they form a unique fingerprint.

Cite this