Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection

Linmei Hu; Ziwang Zhao; Weijian Qi; Xuemeng Song; Liqiang Nie

doi:10.1016/j.ins.2024.120310

Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection

Linmei Hu^*, Ziwang Zhao, Weijian Qi, Xuemeng Song, Liqiang Nie

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

5 Citations (Scopus)

Abstract

Fake news often involves multimedia information such as text and image to mislead readers, proliferating and expanding its influence. Most existing fake news detection methods apply the co-attention mechanism to fuse multimodal features while ignoring the consistency of image and text in co-attention. In this paper, we propose multimodal matching-aware co-attention networks with mutual knowledge distillation for improving fake news detection. Specifically, we design an image-text matching-aware co-attention mechanism which captures the alignment of image and text for better multimodal fusion. The image-text matching representation can be obtained via a vision-language pre-trained model. Additionally, based on the designed image-text matching-aware co-attention mechanism, we propose to build two co-attention networks respectively centered on text and image for mutual knowledge distillation to improve fake news detection. Extensive experiments on three benchmark datasets demonstrate that our proposed model outperforms existing methods on multimodal fake news detection.

Original language	English
Article number	120310
Journal	Information Sciences
Volume	664
DOIs	https://doi.org/10.1016/j.ins.2024.120310
Publication status	Published - Apr 2024

Keywords

Fake news detection
Image-text matching
Mutual knowledge distillation

Access to Document

10.1016/j.ins.2024.120310

Cite this

@article{71991d2960934ffe86d653ede2b3fbdc,

title = "Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection",

abstract = "Fake news often involves multimedia information such as text and image to mislead readers, proliferating and expanding its influence. Most existing fake news detection methods apply the co-attention mechanism to fuse multimodal features while ignoring the consistency of image and text in co-attention. In this paper, we propose multimodal matching-aware co-attention networks with mutual knowledge distillation for improving fake news detection. Specifically, we design an image-text matching-aware co-attention mechanism which captures the alignment of image and text for better multimodal fusion. The image-text matching representation can be obtained via a vision-language pre-trained model. Additionally, based on the designed image-text matching-aware co-attention mechanism, we propose to build two co-attention networks respectively centered on text and image for mutual knowledge distillation to improve fake news detection. Extensive experiments on three benchmark datasets demonstrate that our proposed model outperforms existing methods on multimodal fake news detection.",

keywords = "Fake news detection, Image-text matching, Mutual knowledge distillation",

author = "Linmei Hu and Ziwang Zhao and Weijian Qi and Xuemeng Song and Liqiang Nie",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Inc.",

year = "2024",

month = apr,

doi = "10.1016/j.ins.2024.120310",

language = "English",

volume = "664",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection

AU - Hu, Linmei

AU - Zhao, Ziwang

AU - Qi, Weijian

AU - Song, Xuemeng

AU - Nie, Liqiang

PY - 2024/4

Y1 - 2024/4

N2 - Fake news often involves multimedia information such as text and image to mislead readers, proliferating and expanding its influence. Most existing fake news detection methods apply the co-attention mechanism to fuse multimodal features while ignoring the consistency of image and text in co-attention. In this paper, we propose multimodal matching-aware co-attention networks with mutual knowledge distillation for improving fake news detection. Specifically, we design an image-text matching-aware co-attention mechanism which captures the alignment of image and text for better multimodal fusion. The image-text matching representation can be obtained via a vision-language pre-trained model. Additionally, based on the designed image-text matching-aware co-attention mechanism, we propose to build two co-attention networks respectively centered on text and image for mutual knowledge distillation to improve fake news detection. Extensive experiments on three benchmark datasets demonstrate that our proposed model outperforms existing methods on multimodal fake news detection.

AB - Fake news often involves multimedia information such as text and image to mislead readers, proliferating and expanding its influence. Most existing fake news detection methods apply the co-attention mechanism to fuse multimodal features while ignoring the consistency of image and text in co-attention. In this paper, we propose multimodal matching-aware co-attention networks with mutual knowledge distillation for improving fake news detection. Specifically, we design an image-text matching-aware co-attention mechanism which captures the alignment of image and text for better multimodal fusion. The image-text matching representation can be obtained via a vision-language pre-trained model. Additionally, based on the designed image-text matching-aware co-attention mechanism, we propose to build two co-attention networks respectively centered on text and image for mutual knowledge distillation to improve fake news detection. Extensive experiments on three benchmark datasets demonstrate that our proposed model outperforms existing methods on multimodal fake news detection.

KW - Fake news detection

KW - Image-text matching

KW - Mutual knowledge distillation

UR - http://www.scopus.com/inward/record.url?scp=85186266627&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2024.120310

DO - 10.1016/j.ins.2024.120310

M3 - Article

AN - SCOPUS:85186266627

SN - 0020-0255

VL - 664

JO - Information Sciences

JF - Information Sciences

M1 - 120310

ER -

Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this