Integrated Parallel System for Audio Conferencing Voice Transcription and Speaker Identification

Ke Miao, Oloff Biermann, Zhen Miao, Simon Leung, Jianhong Wang, Keke Gai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

In response to the request from a well-known international financial corporation, an integrated system prototype was architected and implemented to automatically record corporate audio conferencing, transcribe the recordings to text while identifying speakers, and compile the transcription and identification results into text-based meeting minutes, which then gets sent as meeting summary email attachments as well as saved into a meeting management database. Three technology focuses of this integrated system are discussed in this paper 1) Selection of a 3rd-party audio transcription and identification API (Audio API) through prototyping, factor comparison, and considering the existing technology environment at the corporation. 2) Optimize the adoption of the selected Audio API based on knowledge of Natural Language Process (NLP) methods. 3) Support asynchronous scheduling and processing of concurrent meetings using parallel computing architecture methods. The completed system was evaluated and shown to have met all the requirements from the corporation, perform well in audio language intelligent processing and multi-Threaded parallel execution. With further enhancements, we foresee this system solution has good commercial values and has potential to be adopted widely among other businesses.

Original languageEnglish
Title of host publication2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728165110
DOIs
Publication statusPublished - May 2020
Event2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020 - Shenzhen, China
Duration: 23 May 2020 → …

Publication series

Name2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020

Conference

Conference2020 International Conference on High Performance Big Data and Intelligent Systems, HPBD and IS 2020
Country/TerritoryChina
CityShenzhen
Period23/05/20 → …

Keywords

  • Integrated System
  • Multi-Threading
  • Natural Language Processing
  • Parallel Computing
  • Speaker Identification
  • Speech Transcription
  • factor-based service selection

Fingerprint

Dive into the research topics of 'Integrated Parallel System for Audio Conferencing Voice Transcription and Speaker Identification'. Together they form a unique fingerprint.

Cite this