跳到主要导航 跳到搜索 跳到主要内容

Protein Fold Recognition by Combining Support Vector Machines and Pairwise Sequence Similarity Scores

  • Ke Yan
  • , Jie Wen
  • , Jin Xing Liu
  • , Yong Xu*
  • , Bin Liu*
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Protein fold recognition is one of the most essential steps for protein structure prediction, aiming to classify proteins into known protein folds. There are two main computational approaches: one is the template-based method based on the alignment scores between query-template protein pairs and the other is the machine learning method based on the feature representation and classifier. These two approaches have their own advantages and disadvantages. Can we combine these methods to establish more accurate predictors for protein fold recognition? In this study, we made an initial attempt and proposed two novel algorithms: TSVM-fold and ESVM-fold. TSVM-fold was based on the Support Vector Machines (SVMs), which utilizes a set of pairwise sequence similarity scores generated by three complementary template-based methods, including HHblits, SPARKS-X, and DeepFR. These scores measured the global relationships between query sequences and templates. The comprehensive features of the attributes of the sequences were fed into the SVMs for the prediction. Then the TSVM-fold was further combined with the HHblits algorithm so as to improve its generalization ability. The combined method is called ESVM-fold. Experimental results in two rigorous benchmark datasets (LE and YK datasets) showed that the proposed methods outperform some state-of-the-art methods, indicating that the TSVM-fold and ESVM-fold are efficient predictors for protein fold recognition.

源语言英语
页(从-至)2008-2016
页数9
期刊IEEE/ACM Transactions on Computational Biology and Bioinformatics
18
5
DOI
出版状态已出版 - 2021
已对外发布

指纹

探究 'Protein Fold Recognition by Combining Support Vector Machines and Pairwise Sequence Similarity Scores' 的科研主题。它们共同构成独一无二的指纹。

引用此