A method to improve structural modeling based on conserved domain clusters

Fa Zhang; Lin Xu; Bo Yuan

doi:10.1109/IPDPS.2006.1639540

A method to improve structural modeling based on conserved domain clusters

Fa Zhang^*, Lin Xu, Bo Yuan

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Homology modeling requires an accurate alignment between a query sequence and its homologs with known three-dimensional (3D) information. Current structural modeling techniques largely use entire protein chains as templates, which are selected based only on their sequence alignments with the queries. Protein can be largely described as combinations of conserved domains, and already more than two-third of the known protein domains can be found in the Protein Data Bank (PDB). We presented a method to improve structural modeling based on conserved domain clusters. First, we searched and mapped all the InterPro domains in the entire PDB, partitioned and clustered homologous domains into the domain-based template library. For each of the resulting clusters created, a multiple structural alignment was generated based only on the 3D coordinates of all the residues involved. Then we used the structural alignments as anchors to increase the alignment accuracy between a query and its templates, and consequently improve the quality of predicted structure for query protein. We implemented the method on DAWNING 4000A cluster system. The preliminary results show that our domain-based template library and the structure-anchored alignment protocol can be used for the partial prediction for a majority of known protein sequences with better qualities.

Original language	English
Title of host publication	20th International Parallel and Distributed Processing Symposium, IPDPS 2006
Publisher	IEEE Computer Society
ISBN (Print)	1424400546, 9781424400546
DOIs	https://doi.org/10.1109/IPDPS.2006.1639540
Publication status	Published - 2006
Externally published	Yes
Event	20th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2006 - Rhodes Island, Greece Duration: 25 Apr 2006 → 29 Apr 2006

Publication series

Name	20th International Parallel and Distributed Processing Symposium, IPDPS 2006
Volume	2006

Conference

Conference	20th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2006
Country/Territory	Greece
City	Rhodes Island
Period	25/04/06 → 29/04/06

Access to Document

10.1109/IPDPS.2006.1639540

Cite this

@inproceedings{f1f20232c14b49a181006b24dc492a4b,

title = "A method to improve structural modeling based on conserved domain clusters",

abstract = "Homology modeling requires an accurate alignment between a query sequence and its homologs with known three-dimensional (3D) information. Current structural modeling techniques largely use entire protein chains as templates, which are selected based only on their sequence alignments with the queries. Protein can be largely described as combinations of conserved domains, and already more than two-third of the known protein domains can be found in the Protein Data Bank (PDB). We presented a method to improve structural modeling based on conserved domain clusters. First, we searched and mapped all the InterPro domains in the entire PDB, partitioned and clustered homologous domains into the domain-based template library. For each of the resulting clusters created, a multiple structural alignment was generated based only on the 3D coordinates of all the residues involved. Then we used the structural alignments as anchors to increase the alignment accuracy between a query and its templates, and consequently improve the quality of predicted structure for query protein. We implemented the method on DAWNING 4000A cluster system. The preliminary results show that our domain-based template library and the structure-anchored alignment protocol can be used for the partial prediction for a majority of known protein sequences with better qualities.",

author = "Fa Zhang and Lin Xu and Bo Yuan",

year = "2006",

doi = "10.1109/IPDPS.2006.1639540",

language = "English",

isbn = "1424400546",

series = "20th International Parallel and Distributed Processing Symposium, IPDPS 2006",

publisher = "IEEE Computer Society",

booktitle = "20th International Parallel and Distributed Processing Symposium, IPDPS 2006",

address = "United States",

note = "20th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2006 ; Conference date: 25-04-2006 Through 29-04-2006",

}

Zhang, F, Xu, L & Yuan, B 2006, A method to improve structural modeling based on conserved domain clusters. in 20th International Parallel and Distributed Processing Symposium, IPDPS 2006., 1639540, 20th International Parallel and Distributed Processing Symposium, IPDPS 2006, vol. 2006, IEEE Computer Society, 20th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2006, Rhodes Island, Greece, 25/04/06. https://doi.org/10.1109/IPDPS.2006.1639540

A method to improve structural modeling based on conserved domain clusters. / Zhang, Fa; Xu, Lin; Yuan, Bo.
20th International Parallel and Distributed Processing Symposium, IPDPS 2006. IEEE Computer Society, 2006. 1639540 (20th International Parallel and Distributed Processing Symposium, IPDPS 2006; Vol. 2006).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A method to improve structural modeling based on conserved domain clusters

AU - Zhang, Fa

AU - Xu, Lin

AU - Yuan, Bo

PY - 2006

Y1 - 2006

N2 - Homology modeling requires an accurate alignment between a query sequence and its homologs with known three-dimensional (3D) information. Current structural modeling techniques largely use entire protein chains as templates, which are selected based only on their sequence alignments with the queries. Protein can be largely described as combinations of conserved domains, and already more than two-third of the known protein domains can be found in the Protein Data Bank (PDB). We presented a method to improve structural modeling based on conserved domain clusters. First, we searched and mapped all the InterPro domains in the entire PDB, partitioned and clustered homologous domains into the domain-based template library. For each of the resulting clusters created, a multiple structural alignment was generated based only on the 3D coordinates of all the residues involved. Then we used the structural alignments as anchors to increase the alignment accuracy between a query and its templates, and consequently improve the quality of predicted structure for query protein. We implemented the method on DAWNING 4000A cluster system. The preliminary results show that our domain-based template library and the structure-anchored alignment protocol can be used for the partial prediction for a majority of known protein sequences with better qualities.

AB - Homology modeling requires an accurate alignment between a query sequence and its homologs with known three-dimensional (3D) information. Current structural modeling techniques largely use entire protein chains as templates, which are selected based only on their sequence alignments with the queries. Protein can be largely described as combinations of conserved domains, and already more than two-third of the known protein domains can be found in the Protein Data Bank (PDB). We presented a method to improve structural modeling based on conserved domain clusters. First, we searched and mapped all the InterPro domains in the entire PDB, partitioned and clustered homologous domains into the domain-based template library. For each of the resulting clusters created, a multiple structural alignment was generated based only on the 3D coordinates of all the residues involved. Then we used the structural alignments as anchors to increase the alignment accuracy between a query and its templates, and consequently improve the quality of predicted structure for query protein. We implemented the method on DAWNING 4000A cluster system. The preliminary results show that our domain-based template library and the structure-anchored alignment protocol can be used for the partial prediction for a majority of known protein sequences with better qualities.

UR - http://www.scopus.com/inward/record.url?scp=33847130647&partnerID=8YFLogxK

U2 - 10.1109/IPDPS.2006.1639540

DO - 10.1109/IPDPS.2006.1639540

M3 - Conference contribution

AN - SCOPUS:33847130647

SN - 1424400546

SN - 9781424400546

T3 - 20th International Parallel and Distributed Processing Symposium, IPDPS 2006

BT - 20th International Parallel and Distributed Processing Symposium, IPDPS 2006

PB - IEEE Computer Society

T2 - 20th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2006

Y2 - 25 April 2006 through 29 April 2006

ER -

A method to improve structural modeling based on conserved domain clusters

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this