HearASL: Your Smartphone Can Hear American Sign Language

Yusen Wang; Fan Li; Yadong Xie; Chunhui Duan; Yu Wang

doi:10.1109/JIOT.2022.3232337

HearASL: Your Smartphone Can Hear American Sign Language

Yusen Wang, Fan Li^*, Yadong Xie, Chunhui Duan, Yu Wang

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Sign language is expressed by movements of the hands and facial expressions, which is mainly used by the deaf community. Although some gesture recognition methods are put forward, they possess different defects and are not applicable to deal with the sign language recognition (SLR) problem. In this article, we propose an end-to-end American SLR system with built-in speakers and microphones in smartphones, which enables SLR at both word level and sentence level. The high-level idea is to use the inaudible acoustic signal to estimate channel information and capture the sign language in real time. We use channel impulse response to represent each sign language gesture, which can realize finger-level recognition. We also pay attention to conversion movements between two words and treat them as an additional label when training the sentence-level classification model. We implement a prototype system and run a series of experiments that demonstrate the promising performance of our system. Experimental results show that our approach can achieve an accuracy of 97.2% at word-level recognition and word error rate of 0.9% at sentence-level recognition, respectively.

源语言	英语
页（从-至）	8839-8852
页数	14
期刊	IEEE Internet of Things Journal
卷	10
期	10
DOI	https://doi.org/10.1109/JIOT.2022.3232337
出版状态	已出版 - 15 5月 2023

访问文件

10.1109/JIOT.2022.3232337

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{8b88ee7a089b4e1eacdf3e110f8e3fad,

title = "HearASL: Your Smartphone Can Hear American Sign Language",

abstract = "Sign language is expressed by movements of the hands and facial expressions, which is mainly used by the deaf community. Although some gesture recognition methods are put forward, they possess different defects and are not applicable to deal with the sign language recognition (SLR) problem. In this article, we propose an end-to-end American SLR system with built-in speakers and microphones in smartphones, which enables SLR at both word level and sentence level. The high-level idea is to use the inaudible acoustic signal to estimate channel information and capture the sign language in real time. We use channel impulse response to represent each sign language gesture, which can realize finger-level recognition. We also pay attention to conversion movements between two words and treat them as an additional label when training the sentence-level classification model. We implement a prototype system and run a series of experiments that demonstrate the promising performance of our system. Experimental results show that our approach can achieve an accuracy of 97.2% at word-level recognition and word error rate of 0.9% at sentence-level recognition, respectively.",

keywords = "Acoustic sensing, American sign language (ASL), mobile computing",

author = "Yusen Wang and Fan Li and Yadong Xie and Chunhui Duan and Yu Wang",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2023",

month = may,

day = "15",

doi = "10.1109/JIOT.2022.3232337",

language = "English",

volume = "10",

pages = "8839--8852",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - HearASL

T2 - Your Smartphone Can Hear American Sign Language

AU - Wang, Yusen

AU - Li, Fan

AU - Xie, Yadong

AU - Duan, Chunhui

AU - Wang, Yu

PY - 2023/5/15

Y1 - 2023/5/15

N2 - Sign language is expressed by movements of the hands and facial expressions, which is mainly used by the deaf community. Although some gesture recognition methods are put forward, they possess different defects and are not applicable to deal with the sign language recognition (SLR) problem. In this article, we propose an end-to-end American SLR system with built-in speakers and microphones in smartphones, which enables SLR at both word level and sentence level. The high-level idea is to use the inaudible acoustic signal to estimate channel information and capture the sign language in real time. We use channel impulse response to represent each sign language gesture, which can realize finger-level recognition. We also pay attention to conversion movements between two words and treat them as an additional label when training the sentence-level classification model. We implement a prototype system and run a series of experiments that demonstrate the promising performance of our system. Experimental results show that our approach can achieve an accuracy of 97.2% at word-level recognition and word error rate of 0.9% at sentence-level recognition, respectively.

AB - Sign language is expressed by movements of the hands and facial expressions, which is mainly used by the deaf community. Although some gesture recognition methods are put forward, they possess different defects and are not applicable to deal with the sign language recognition (SLR) problem. In this article, we propose an end-to-end American SLR system with built-in speakers and microphones in smartphones, which enables SLR at both word level and sentence level. The high-level idea is to use the inaudible acoustic signal to estimate channel information and capture the sign language in real time. We use channel impulse response to represent each sign language gesture, which can realize finger-level recognition. We also pay attention to conversion movements between two words and treat them as an additional label when training the sentence-level classification model. We implement a prototype system and run a series of experiments that demonstrate the promising performance of our system. Experimental results show that our approach can achieve an accuracy of 97.2% at word-level recognition and word error rate of 0.9% at sentence-level recognition, respectively.

KW - Acoustic sensing

KW - American sign language (ASL)

KW - mobile computing

UR - http://www.scopus.com/inward/record.url?scp=85146251719&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2022.3232337

DO - 10.1109/JIOT.2022.3232337

M3 - Article

AN - SCOPUS:85146251719

SN - 2327-4662

VL - 10

SP - 8839

EP - 8852

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

IS - 10

ER -

HearASL: Your Smartphone Can Hear American Sign Language

摘要

访问文件

其它文件与链接

指纹

引用此