Toward automatic audio description generation for accessible videos

Yujia Wang, Wei Liang

科研成果: 书/报告/会议事项章节会议稿件同行评审

39 引用 (Scopus)

摘要

Video accessibility is essential for people with visual impairments. Audio descriptions describe what is happening on-screen, e.g., physical actions, facial expressions, and scene changes. Generating highquality audio descriptions requires a lot of manual description generation [50]. To address this accessibility obstacle, we built a system that analyzes the audiovisual contents of a video and generates the audio descriptions. The system consisted of three modules: AD insertion time prediction, AD generation, and AD optimization. We evaluated the quality of our system on five types of videos by conducting qualitative studies with 20 sighted users and 12 users who were blind or visually impaired. Our findings revealed how audio description preferences varied with user types and video types. Based on our study's analysis, we provided recommendations for the development of future audio description generation technologies.

源语言英语
主期刊名CHI 2021 - Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems
主期刊副标题Making Waves, Combining Strengths
出版商Association for Computing Machinery
ISBN(电子版)9781450380966
DOI
出版状态已出版 - 6 5月 2021
活动10th International Conference on Materials Processing and Characterisation, ICMPC 2020 - Mathura, U.P., 印度
期限: 21 2月 202023 2月 2020

出版系列

姓名Conference on Human Factors in Computing Systems - Proceedings

会议

会议10th International Conference on Materials Processing and Characterisation, ICMPC 2020
国家/地区印度
Mathura, U.P.
时期21/02/2023/02/20

指纹

探究 'Toward automatic audio description generation for accessible videos' 的科研主题。它们共同构成独一无二的指纹。

引用此