Self-Supervised Graph Information Bottleneck for Multiview Molecular Embedding Learning

Changsheng Li*, Kaihang Mao, Shiye Wang, Ye Yuan, Guoren Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

In the field of computer-aided drug discovery, identifying promising drug candidates from small molecule libraries requires meaningful molecular embeddings for downstream tasks, such as property prediction. However, obtaining experimentally determined molecular property measurements is often expensive and time-consuming, making it challenging to train molecular encoders with limited supervision. In addition, molecules can be represented in two ways: as 2-D chemical-bond structures and 3-D geometry structures. Molecular embedding learning using only one of these representations can result in information loss, and effective fusion of the two views has not been fully explored. To address these challenges, we propose a new approach called the self-supervised multiview graph neural network (SMV-GNN) for molecular embedding learning. Our approach involves a self-supervised task that promotes the representation ability of the molecular encoder without requiring extra human-annotation data. Specifically, we use chemical-bond-based graph structures as inputs to predict interatom distances from the 2-D view and randomly shuffle a ratio of atoms in the 3-D coordinate-based graphs to predict atom rationality from the 3-D view. We further improve the representation ability of the molecular embedding by using information bottleneck to learn essential shared feature representations by discarding superfluous information from the 2-D/3-D views for downstream tasks. We evaluate our proposed SMV-GNN approach on seven benchmark datasets for molecule property-prediction tasks, and demonstrate that it outperforms the current state-of-the-art methods.

源语言英语
页(从-至)1554-1562
页数9
期刊IEEE Transactions on Artificial Intelligence
5
4
DOI
出版状态已出版 - 1 4月 2024

指纹

探究 'Self-Supervised Graph Information Bottleneck for Multiview Molecular Embedding Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此