TY - GEN
T1 - A speech-video synchrony quality metric using COIA
AU - Wei, Yaodu
AU - Xie, Xiang
AU - Kuang, Jingming
AU - Han, Xinlu
PY - 2010
Y1 - 2010
N2 - A quality model was built to assess the influence of speechvideo asynchrony on the audio-visual quality perception. The audio-visual contents were separated into two categories: "speaker inside" and "speaker outside", depending on whether the speaker is inside the video. For the first category, speech was shifted in a small scale. DCT and MFCC coefficients were calculated from video and speech separately. A Co-inertia Analysis (CoIA) was used to decide the speech-video correlation, and as the speech progressively shifts, a correlation curve emerged. The curve was modeled by an Gaussian function, and then the function was used to predict the perceptual quality. On the other hand, a Gaussian curve was used to predict the perceptual quality of the "speaker outside" category. A subjective test proved the effectiveness of the proposed method.
AB - A quality model was built to assess the influence of speechvideo asynchrony on the audio-visual quality perception. The audio-visual contents were separated into two categories: "speaker inside" and "speaker outside", depending on whether the speaker is inside the video. For the first category, speech was shifted in a small scale. DCT and MFCC coefficients were calculated from video and speech separately. A Co-inertia Analysis (CoIA) was used to decide the speech-video correlation, and as the speech progressively shifts, a correlation curve emerged. The curve was modeled by an Gaussian function, and then the function was used to predict the perceptual quality. On the other hand, a Gaussian curve was used to predict the perceptual quality of the "speaker outside" category. A subjective test proved the effectiveness of the proposed method.
KW - Asynchrony
KW - Audio-visual quality
KW - Co-inertia analysis
KW - QVGA
KW - Speech
UR - http://www.scopus.com/inward/record.url?scp=79952365940&partnerID=8YFLogxK
U2 - 10.1109/PV.2010.5706835
DO - 10.1109/PV.2010.5706835
M3 - Conference contribution
AN - SCOPUS:79952365940
SN - 9781424495214
T3 - PV 2010 - 2010 18th International Packet Video Workshop
SP - 173
EP - 177
BT - PV 2010 - 2010 18th International Packet Video Workshop
T2 - 2010 18th International Packet Video Workshop, PV 2010
Y2 - 13 December 2010 through 14 December 2010
ER -