Protecting Children from Violent Short Videos: A Child-Attentive Multimodal Multitask Learning Approach

  • Chenxing Zhao
  • , Liang Yang
  • , Junwei Kuang
  • , Zhijun Yan*
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

Short video platforms, such as TikTok, have gained immense popularity, but also contain harmful content like violence, threatening minors’ mental health. This paper proposes a Child-Attentive Multimodal Multitask Learning (CAMML) method for accurate violent short video detection. Unlike existing methods that neglect text cues, correlations with other harmful content, and children’s unique cognitive characteristics, CAMML integrates visual, auditory, and text modalities. It features a child-specific attention mechanism and a multi-task learning approach, jointly training violent video classification alongside tasks like detecting unpleasant and obscene content. Experiments on the MOB dataset that targets malicious and benign content in children’s videos demonstrate CAMML’s superior performance, achieving a 90.02% AUC. The method provides a robust solution for filtering violent content, fostering a clear online environment for children.

Original languageEnglish
JournalPacific Asia Conference on Information Systems
Publication statusPublished - 2025
Externally publishedYes
Event29th Pacific Asia Conference on Information Systems, PACIS 2025 - Kuala Lumpur, Malaysia
Duration: 5 Jul 20259 Jul 2025

Keywords

  • Children
  • multimodality
  • multitask
  • video classification
  • violence

Fingerprint

Dive into the research topics of 'Protecting Children from Violent Short Videos: A Child-Attentive Multimodal Multitask Learning Approach'. Together they form a unique fingerprint.

Cite this