SOFM-top: Protein remote homology detection and fold recognition based on sequence-order frequency matrix

Junjie Chen, Mingyue Guo, Xiaolong Wang, Bin Liu*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Protein remote homology detection and fold recognition are critical for the studies of protein structure and function. Currently, the profile-based methods showed the state-of-the-art performance in this field, which are based on widely used sequence profiles, such as Position-Specific Frequency Matrix (PSFM) and Position-Specific Scoring Matrix (PSSM). However, these approaches ignore the sequence-order effects along protein sequence. In this study, we proposed a novel profile, called Sequence-Order Frequency Matrix (SOFM), which can incorporate the sequence-order information and extract the evolutionary information from Multiple Sequence Alignment (MSA). Statistical tests and experimental results demonstrated its effects. Combined with a previously proposed approach Top-n-grams, the SOFM was then applied to remote homology detection and fold recognition, and a computational predictor called SOFM-Top was proposed. Evaluated on four benchmark datasets, it outperformed other state-of-the-art methods in this filed, indicating that SOFM-Top would be a more useful tool, and SOFM is a richer representation than PSFM and PSSM. SOFM will have many potential applications since profiles have been widely used for constructing computational predictors in the studies of protein structure and function.

源语言英语
主期刊名Intelligent Computing Theories and Application - 13th International Conference, ICIC 2017, Proceedings
编辑De-Shuang Huang, Kang-Hyun Jo, Juan Carlos Figueroa-Garcia
出版商Springer Verlag
469-480
页数12
ISBN(印刷版)9783319633114
DOI
出版状态已出版 - 2017
已对外发布
活动13th International Conference on Intelligent Computing, ICIC 2017 - Liverpool, 英国
期限: 7 8月 201710 8月 2017

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
10362 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议13th International Conference on Intelligent Computing, ICIC 2017
国家/地区英国
Liverpool
时期7/08/1710/08/17

指纹

探究 'SOFM-top: Protein remote homology detection and fold recognition based on sequence-order frequency matrix' 的科研主题。它们共同构成独一无二的指纹。

引用此