Geometric Graph Learning for Predicting Protein Mutation Effect

Kangfei Zhao, Yu Rong, Biaobin Jiang, Jianheng Tang, Hengtong Zhang, Jeffrey Xu Yu, Peilin Zhao

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Proteins govern a wide range of biological systems. Evaluating the changes in protein properties upon protein mutation is a fundamental application of protein design, where modeling the 3D protein structure is a principal task for AI-driven computational approaches. Existing deep learning (DL) approaches represent the protein structure as a 3D geometric graph and simplify the graph modeling to different degrees, thereby failing to capture the low-level atom patterns and high-level amino acid patterns simultaneously. In addition, limited training samples with ground truth labels and protein structures further restrict the effectiveness of DL approaches. In this paper, we propose a new graph learning framework, Hierarchical Graph Invariant Network (HGIN), a fine-grained and data-efficient graph neural encoder for encoding protein structures and predicting the mutation effect on protein properties. For fine-grained modeling, HGIN hierarchically models the low-level interactions of atoms and the high-level interactions of amino acid residues by Graph Neural Networks. For data efficiency, HGIN preserves the invariant encoding for atom permutation and coordinate transformation, which is an intrinsic inductive bias of property prediction that bypasses data augmentations. We integrate HGIN into a Siamese network to predict the quantitative effect on protein properties upon mutations. Our approach outperforms 9 state-of-the-art approaches on 3 protein datasets. More inspiringly, when predicting the neutralizing ability of human antibodies against COVID-19 mutant viruses, HGIN achieves an absolute improvement of 0.23 regarding the Spearman coefficient.

源语言英语
主期刊名CIKM 2023 - Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
出版商Association for Computing Machinery
3412-3422
页数11
ISBN(电子版)9798400701245
DOI
出版状态已出版 - 21 10月 2023
活动32nd ACM International Conference on Information and Knowledge Management, CIKM 2023 - Birmingham, 英国
期限: 21 10月 202325 10月 2023

出版系列

姓名International Conference on Information and Knowledge Management, Proceedings

会议

会议32nd ACM International Conference on Information and Knowledge Management, CIKM 2023
国家/地区英国
Birmingham
时期21/10/2325/10/23

指纹

探究 'Geometric Graph Learning for Predicting Protein Mutation Effect' 的科研主题。它们共同构成独一无二的指纹。

引用此