TY - JOUR
T1 - An improved profile-level domain linker propensity index for protein domain boundary prediction
AU - Zhang, Yanfeng
AU - Liu, Bin
AU - Dong, Qiwen
AU - Jin, Victor X.
PY - 2011/1
Y1 - 2011/1
N2 - Protein domain boundary prediction is critical for understanding protein structure and function. In this study, we present a novel method, an order profile domain linker propensity index (OPI), which uses the evolutionary information extracted from the protein sequence frequency profiles calculated from the multiple sequence alignments. A protein sequence is first converted into smooth and normalized numeric order profiles by OPI, from which the domain linkers can be predicted. By discriminating the different frequencies of the amino acids in the protein sequence frequency profiles, OPI clearly shows better performance than our previous method, a binary profile domain linker propensity index (PDLI). We tested our new method on two different datasets, SCOP-1 dataset and SCOP-2 dataset, and we were able to achieve a precision of 0.82 and 0.91 respectively. OPI also outperforms other residue-level, profile-level indexes as well as other state-of-the-art methods.
AB - Protein domain boundary prediction is critical for understanding protein structure and function. In this study, we present a novel method, an order profile domain linker propensity index (OPI), which uses the evolutionary information extracted from the protein sequence frequency profiles calculated from the multiple sequence alignments. A protein sequence is first converted into smooth and normalized numeric order profiles by OPI, from which the domain linkers can be predicted. By discriminating the different frequencies of the amino acids in the protein sequence frequency profiles, OPI clearly shows better performance than our previous method, a binary profile domain linker propensity index (PDLI). We tested our new method on two different datasets, SCOP-1 dataset and SCOP-2 dataset, and we were able to achieve a precision of 0.82 and 0.91 respectively. OPI also outperforms other residue-level, profile-level indexes as well as other state-of-the-art methods.
KW - Domain boundary
KW - Domain linker
KW - Multiple sequence alignments
KW - Sequence-based prediction
UR - http://www.scopus.com/inward/record.url?scp=78751536327&partnerID=8YFLogxK
U2 - 10.2174/092986611794328717
DO - 10.2174/092986611794328717
M3 - Article
C2 - 20955175
AN - SCOPUS:78751536327
SN - 0929-8665
VL - 18
SP - 7
EP - 16
JO - Protein and Peptide Letters
JF - Protein and Peptide Letters
IS - 1
ER -