TY - GEN
T1 - Non-Acoustic Speech Sensing System Based on Flexible Piezoelectric
AU - Yuan, Shiji
AU - Sun, Ying
AU - Wang, Shuai
AU - Chen, Xinlei
AU - Ding, Ying
AU - Zheng, Dezhi
AU - Fan, Shangchun
N1 - Publisher Copyright:
© 2022 Owner/Author.
PY - 2022/11/6
Y1 - 2022/11/6
N2 - Speech is one of the most important biological signals to complement human-human and human-computer interaction. Traditional speech datasets were collected by air microphones, but using these datasets in noisy environments such as factories is practically challenging. Therefore, speech recognition in noisy environments poses higher requirements. The non-acoustic speech dataset plays a significant role in robust speech recognition under high background noise. Existing datasets suffered from dull sound, low intelligibility and poor recognition accuracy due to hardware and computer technology limitations. This paper presents a non-acoustic speech sensing system based on flexible piezoelectric. The system collected vibration signals from the jaws of six males and five females, and the corpus contained ten different control commands at 90 dB of background noise. The dataset is reliable with high intelligibility and capable of achieving 93.7% recognition accuracy by calculation. With the aforementioned benefits, this dataset is an essential tool for studying human-computer interaction in high-noise environments, analyzing human acoustic properties, and aiding medical rehabilitation.
AB - Speech is one of the most important biological signals to complement human-human and human-computer interaction. Traditional speech datasets were collected by air microphones, but using these datasets in noisy environments such as factories is practically challenging. Therefore, speech recognition in noisy environments poses higher requirements. The non-acoustic speech dataset plays a significant role in robust speech recognition under high background noise. Existing datasets suffered from dull sound, low intelligibility and poor recognition accuracy due to hardware and computer technology limitations. This paper presents a non-acoustic speech sensing system based on flexible piezoelectric. The system collected vibration signals from the jaws of six males and five females, and the corpus contained ten different control commands at 90 dB of background noise. The dataset is reliable with high intelligibility and capable of achieving 93.7% recognition accuracy by calculation. With the aforementioned benefits, this dataset is an essential tool for studying human-computer interaction in high-noise environments, analyzing human acoustic properties, and aiding medical rehabilitation.
KW - flexible piezoelectric sensor
KW - non-acoustic speech
KW - speech command data
UR - http://www.scopus.com/inward/record.url?scp=85147549218&partnerID=8YFLogxK
U2 - 10.1145/3560905.3567768
DO - 10.1145/3560905.3567768
M3 - Conference contribution
AN - SCOPUS:85147549218
T3 - SenSys 2022 - Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems
SP - 1055
EP - 1060
BT - SenSys 2022 - Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems
PB - Association for Computing Machinery, Inc
T2 - 20th ACM Conference on Embedded Networked Sensor Systems, SenSys 2022
Y2 - 6 November 2022 through 9 November 2022
ER -