Abstract
Short video platforms, such as TikTok, have gained immense popularity, but also contain harmful content like violence, threatening minors’ mental health. This paper proposes a Child-Attentive Multimodal Multitask Learning (CAMML) method for accurate violent short video detection. Unlike existing methods that neglect text cues, correlations with other harmful content, and children’s unique cognitive characteristics, CAMML integrates visual, auditory, and text modalities. It features a child-specific attention mechanism and a multi-task learning approach, jointly training violent video classification alongside tasks like detecting unpleasant and obscene content. Experiments on the MOB dataset that targets malicious and benign content in children’s videos demonstrate CAMML’s superior performance, achieving a 90.02% AUC. The method provides a robust solution for filtering violent content, fostering a clear online environment for children.
| Original language | English |
|---|---|
| Journal | Pacific Asia Conference on Information Systems |
| Publication status | Published - 2025 |
| Externally published | Yes |
| Event | 29th Pacific Asia Conference on Information Systems, PACIS 2025 - Kuala Lumpur, Malaysia Duration: 5 Jul 2025 → 9 Jul 2025 |
Keywords
- Children
- multimodality
- multitask
- video classification
- violence
Fingerprint
Dive into the research topics of 'Protecting Children from Violent Short Videos: A Child-Attentive Multimodal Multitask Learning Approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver