TY - JOUR
T1 - Joint masking and self-supervised strategies for inferring small molecule-miRNA associations
AU - Zhou, Zhecheng
AU - Zhuo, Linlin
AU - Fu, Xiangzheng
AU - Lv, Juan
AU - Zou, Quan
AU - Qi, Ren
N1 - Publisher Copyright:
© 2023 The Authors
PY - 2024/3/12
Y1 - 2024/3/12
N2 - Inferring small molecule-miRNA associations (MMAs) is crucial for revealing the intricacies of biological processes and disease mechanisms. Deep learning, renowned for its exceptional speed and accuracy, is extensively used for predicting MMAs. However, given their heavy reliance on data, inaccuracies during data collection can make these methods susceptible to noise interference. To address this challenge, we introduce the joint masking and self-supervised (JMSS)-MMA model. This model synergizes graph autoencoders with a probability distribution-based masking strategy, effectively countering the impact of noisy data and enabling precise predictions of unknown MMAs. Operating in a self-supervised manner, it deeply encodes the relationship data of small molecules and miRNA through the graph autoencoder, delving into its latent information. Our masking strategy has successfully reduced data noise, enhancing prediction accuracy. To our knowledge, this is the pioneering integration of a masking strategy with graph autoencoders for MMA prediction. Furthermore, the JMSS-MMA model incorporates a node-degree–based decoder, deepening the understanding of the network's structure. Experiments on two mainstream datasets confirm the model's efficiency and precision, and ablation studies further attest to its robustness. We firmly believe that this model will revolutionize drug development, personalized medicine, and biomedical research.
AB - Inferring small molecule-miRNA associations (MMAs) is crucial for revealing the intricacies of biological processes and disease mechanisms. Deep learning, renowned for its exceptional speed and accuracy, is extensively used for predicting MMAs. However, given their heavy reliance on data, inaccuracies during data collection can make these methods susceptible to noise interference. To address this challenge, we introduce the joint masking and self-supervised (JMSS)-MMA model. This model synergizes graph autoencoders with a probability distribution-based masking strategy, effectively countering the impact of noisy data and enabling precise predictions of unknown MMAs. Operating in a self-supervised manner, it deeply encodes the relationship data of small molecules and miRNA through the graph autoencoder, delving into its latent information. Our masking strategy has successfully reduced data noise, enhancing prediction accuracy. To our knowledge, this is the pioneering integration of a masking strategy with graph autoencoders for MMA prediction. Furthermore, the JMSS-MMA model incorporates a node-degree–based decoder, deepening the understanding of the network's structure. Experiments on two mainstream datasets confirm the model's efficiency and precision, and ablation studies further attest to its robustness. We firmly believe that this model will revolutionize drug development, personalized medicine, and biomedical research.
KW - MT: Bioinformatics
KW - decoder based on node degree
KW - graph masked autoencoder
KW - probability distribution-based masking strategy
KW - self-supervised strategy
KW - small-molecule-miRNA associations
UR - https://www.scopus.com/pages/publications/85181720634
U2 - 10.1016/j.omtn.2023.102103
DO - 10.1016/j.omtn.2023.102103
M3 - Article
AN - SCOPUS:85181720634
SN - 2162-2531
VL - 35
JO - Molecular Therapy Nucleic Acids
JF - Molecular Therapy Nucleic Acids
IS - 1
M1 - 102103
ER -