A discriminative method for protein remote homology detection based on N-nary profiles

Bin Liu, Lei Lin, Xiaolong Wang, Qiwen Dong, Xuan Wang

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

Protein homology detection is a key problem in computational biology. In this paper, a novel building block for protein called N-nary profile which contains the evolutionary information of protein sequence frequency profiles has been presented. The protein sequence frequency profiles calculated from the multiple sequence alignments outputted by PSI-BLAST are converted into N-nary profiles. Such N-nary profiles are filtered by a feature selection algorithm called chi-square algorithm. The protein sequences are transformed into fixed-dimension feature vectors by the occurrence times of each N-nary profile and then the corresponding vectors are inputted to support vector machine (SVM). The latent semantic analysis (LSA) model, an efficient feature extraction algorithm, is adopted to further improve the performance of this method. When tested on the SCOP 1.53 data set, the prediction performance of N-nary profile method outperforms all compared methods of protein remote homology detection. The ROC50 score is 0.736, which is higher than the current best method for nearly 4 percent.

源语言英语
主期刊名Bioinformatics Research and Development - Second International Conference, BIRD 2008, Proceedings
出版商Springer Verlag
74-86
页数13
ISBN(印刷版)9783540705987
DOI
出版状态已出版 - 2008
已对外发布
活动2nd International Conference on Bioinformatics Research and Development, BIRD 2008 - Vienna, 奥地利
期限: 7 7月 20089 7月 2008

出版系列

姓名Communications in Computer and Information Science
13
ISSN(印刷版)1865-0929

会议

会议2nd International Conference on Bioinformatics Research and Development, BIRD 2008
国家/地区奥地利
Vienna
时期7/07/089/07/08

指纹

探究 'A discriminative method for protein remote homology detection based on N-nary profiles' 的科研主题。它们共同构成独一无二的指纹。

引用此