Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data

Zepeng Li, Jiawei Zhou, Zhengyi An, Wenchuan Cheng, Bin Hu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

As a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide detection, but related research still faces some difficulties, such as data imbalance and expression implicitness. In this paper, we propose a Deep Hierarchical Ensemble model for Suicide Detection (DHE-SD) based on a hierarchical ensemble strategy, and construct a dataset based on Sina Weibo, which contains more than 550 thousand posts from 4521 users. To verify the effectiveness of the model, we also conduct experiments on a public Weibo dataset containing 7329 users’ posts. The proposed model achieves the best performance on both the constructed dataset and the public dataset. In addition, in order to make the model applicable to a wider population, we use the proposed sentence-level mask mechanism to delete user posts with strong suicidal ideation. Experiments show that the proposed model can still effectively identify social media users with suicidal ideation even when the performance of the baseline models decrease significantly.

Original languageEnglish
Article number442
JournalEntropy
Volume24
Issue number4
DOIs
Publication statusPublished - Apr 2022

Keywords

  • China
  • Sina Weibo
  • deep neural network
  • imbalanced data
  • social media
  • suicide ideation detection

Fingerprint

Dive into the research topics of 'Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data'. Together they form a unique fingerprint.

Cite this