A Novel Audio-Oriented Learning Strategies for Character Recognition

Changbin Lu, Guangyu Gao*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this paper, we propose a robust audio-oriented learning strategies to address the issue of character recognition in movie/TV-series. Identifying major characters in movies/TV-series has drawn researcher's great interests. Most of them have explored some character recognition and retrieval applications based on visual appearance, whereas visual appearance is inconsistent throughout the whole video. Our approach, mainly focusing on audio, features that: (i) we extract both spectral and temporal audio features of Mel-scale Frequency Cepstral Coefficients(MFCC), prosodic, average pause length, speaking rate features, pitch and short time energy, and also the complementarity of Gabor features, (ii) we adopt Multi-Task Joint Sparse Representation and Recognition (MTJSRC) model for learning with all the features except Gabor, and SVM model with Gabor features, (iii) regarding these original features as seeds, we extend the training set from talk shows with semi-supervise learning, (iv) the Conditional Random Field (CRF) model with consideration of the constrains in time sequence is introduced to enhance the final labelling. Finally, experimental results demonstrates the effectiveness performance of our approach.

源语言英语
主期刊名Proceedings - 2016 International Conference on Virtual Reality and Visualization, ICVRV 2016
编辑Dandan Ding, Dangxiao Wang, Jian Chen, Xun Luo
出版商Institute of Electrical and Electronics Engineers Inc.
459-464
页数6
ISBN(电子版)9781509051885
DOI
出版状态已出版 - 1 6月 2017
活动6th International Conference on Virtual Reality and Visualization, ICVRV 2016 - Hangzhou, Zhejiang, 中国
期限: 24 9月 201626 9月 2016

出版系列

姓名Proceedings - 2016 International Conference on Virtual Reality and Visualization, ICVRV 2016

会议

会议6th International Conference on Virtual Reality and Visualization, ICVRV 2016
国家/地区中国
Hangzhou, Zhejiang
时期24/09/1626/09/16

指纹

探究 'A Novel Audio-Oriented Learning Strategies for Character Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此