Integrated Parallel System for Audio Conferencing Voice Transcription and Speaker Identification

Ke Miao, Oloff Biermann, Zhen Miao, Simon Leung, Jianhong Wang, Keke Gai

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

In response to the request from a well-known international financial corporation, an integrated system prototype was architected and implemented to automatically record corporate audio conferencing, transcribe the recordings to text while identifying speakers, and compile the transcription and identification results into text-based meeting minutes, which then gets sent as meeting summary email attachments as well as saved into a meeting management database. Three technology focuses of this integrated system are discussed in this paper 1) Selection of a 3rd-party audio transcription and identification API (Audio API) through prototyping, factor comparison, and considering the existing technology environment at the corporation. 2) Optimize the adoption of the selected Audio API based on knowledge of Natural Language Process (NLP) methods. 3) Support asynchronous scheduling and processing of concurrent meetings using parallel computing architecture methods. The completed system was evaluated and shown to have met all the requirements from the corporation, perform well in audio language intelligent processing and multi-Threaded parallel execution. With further enhancements, we foresee this system solution has good commercial values and has potential to be adopted widely among other businesses.

源语言英语
主期刊名2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781728165110
DOI
出版状态已出版 - 5月 2020
活动2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020 - Shenzhen, 中国
期限: 23 5月 2020 → …

出版系列

姓名2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020

会议

会议2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020
国家/地区中国
Shenzhen
时期23/05/20 → …

指纹

探究 'Integrated Parallel System for Audio Conferencing Voice Transcription and Speaker Identification' 的科研主题。它们共同构成独一无二的指纹。

引用此