Abstract
As an important task in protein structure and function studies, protein fold recognition has attracted more and more attention. The existing computational predictors in this field treat this task as a multi-classification problem, ignoring the relationship among proteins in the dataset. However, previous studies showed that their relationship is critical for protein homology analysis. In this study, the protein fold recognition is treated as an information retrieval task. The Learning to Rank model (LTR) was employed to retrieve the query protein against the template proteins to find the template proteins in the same fold with the query protein in a supervised manner. The triadic closure principle (TCP) was performed on the ranking list generated by the LTR to improve its accuracy by considering the relationship among the query protein and the template proteins in the ranking list. Finally, a predictor called Fold-LTR-TCP was proposed. The rigorous test on the LE benchmark dataset showed that the Fold-LTR-TCP predictor achieved an accuracy of 73.2%, outperforming all the other competing methods.
Original language | English |
---|---|
Pages (from-to) | 2185-2193 |
Number of pages | 9 |
Journal | Briefings in Bioinformatics |
Volume | 21 |
Issue number | 6 |
DOIs | |
Publication status | Published - 1 Nov 2020 |
Keywords
- Feature mapping strategy
- Learning to Rank
- Protein fold recognition
- Triadic closure principle