ANA-Mix: A Synthetic Corpus of Mandarin Speech in Airport Noise Conditions

  • Xiaoliang Wang*
  • , Yu Wang
  • , Ye Liu
  • , Xudong Zhou
  • , Fengming Liu
  • , Fengge Yu
  • , Shuai Zhang
  • , Guozheng Li
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents the Airport Noise-AISHELL Mix (ANA-Mix), a rich and realistic dataset tailored for advancing speech recognition and interactive systems in complex airport acoustic conditions. The noisy speech dataset is constructed by combining the publicly available AISHELL-3 Mandarin speech dataset with the environmental noise data actually collected at airports. The AISHELL-3 dataset provides a rich variety of high-quality sentence recordings, while the airport noise data captures a variety of typical airport noise scenarios, including crowd conversations, luggage rolling, and boarding announcements. A data mixing method is used to superimpose clean speech and randomly selected airport noise in waveforms to create 200,000 sets of noisy speech samples, including approximately 100,000 sets of single-person noisy speech and another 100,000 sets of multi-person (2~4 speakers) speech. This voice construction results are close to the actual deployment environment. The dataset constructed in this study can be used for a variety of tasks such as speech recognition, voiceprint recognition, and speech enhancement, demonstrating its potential value in improving the performance of voice interaction systems.

Original languageEnglish
Title of host publication2025 IEEE 3rd International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages98-102
Number of pages5
ISBN (Electronic)9798331503567
DOIs
Publication statusPublished - 2025
Event3rd IEEE International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025 - Jinzhou, China
Duration: 29 Aug 202531 Aug 2025

Publication series

Name2025 IEEE 3rd International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025

Conference

Conference3rd IEEE International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025
Country/TerritoryChina
CityJinzhou
Period29/08/2531/08/25

Keywords

  • airport noise
  • AISHELL-3
  • speech enhancement
  • speech recognition

Fingerprint

Dive into the research topics of 'ANA-Mix: A Synthetic Corpus of Mandarin Speech in Airport Noise Conditions'. Together they form a unique fingerprint.

Cite this