Detecting depression in speech: Comparison and combination between different speech types

Hailiang Long, Zhenghao Guo, Xia Wu, Bin Hu*, Zhenyu Liu, Hanshu Cai

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

30 Citations (Scopus)

Abstract

Depression is a mental disorder of high prevalence, leading to a negative effect on individuals, their families, society and the economy. In recent years, the problem of automatic detection of depression from the speech signal has gained more interest. In this paper, a new multiple classifier system for depression recognition was developed and tested. The novel aspect of this methodology is the combination of different speech types and emotions. First of all, using a sample of 74 subjects (37 depressed patients and 37 healthy controls), we examined the discriminative power of different speech types (interview, picture description, and reading) and speech emotions (positive, neutral, and negative). Some voice features (e.g. short time energy, intensity, loudness, zero-crossing rate (ZCR), F0, jitter, shimmer, formants, mel frequency cepstral coefficients (MFCC), linear prediction coefficient (LPC), line spectrum pair (LSP), and perceptual linear predictive coefficients (PLP)) were tested. Then, a new multiple classifier method was proposed to detect depression. It was observed that the overall recognition rate using interview speech was higher than employing picture description speech and reading speech. Furthermore, neutral speech showed better performance than positive and negative speech. Among these features, short time energy, ZCR, LPC, MFCC and LSP were the robust features that gave high accuracy in different types of speech. Finally, this new approach showed a high accuracy of 78.02%, giving high encouragement for detecting depression in speech.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
EditorsIllhoi Yoo, Jane Huiru Zheng, Yang Gong, Xiaohua Tony Hu, Chi-Ren Shyu, Yana Bromberg, Jean Gao, Dmitry Korkin
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1052-1058
Number of pages7
ISBN (Electronic)9781509030491
DOIs
Publication statusPublished - 15 Dec 2017
Externally publishedYes
Event2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017 - Kansas City, United States
Duration: 13 Nov 201716 Nov 2017

Publication series

NameProceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
Volume2017-January

Conference

Conference2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
Country/TerritoryUnited States
CityKansas City
Period13/11/1716/11/17

Keywords

  • depression
  • multiple classifier system
  • speech emotions
  • speech types
  • voice features

Fingerprint

Dive into the research topics of 'Detecting depression in speech: Comparison and combination between different speech types'. Together they form a unique fingerprint.

Cite this