A Multi-Scale Low-Bitrate Speech Codec for LEO Communication With Deep Feature Preservation Losses

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper introduces a low-bitrate neural speech codec specifically designed to address the bandwidth limitations of Low Earth Orbit satellite communication. To tackle the challenge of ensuring speech quality under such strict bitrate and bandwidth constraints, we propose a novel framework that effectively balances compression and perceptual quality. The first innovation is a multi-scale residual network that significantly enhances coding efficiency while maintaining low complexity. The second innovation involves a set of deep feature preservation losses that improve the perceptual naturalness of reconstructed speech. Experimental results demonstrate that our approach achieves high-quality speech reconstruction at a bitrate of just 1.5 kbps with a 24 kHz sampling rate, making it highly suitable for low-bitrate speech transmission in LEO communication scenarios.

Original languageEnglish
Title of host publication2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665478014
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025 - Shanghai, China
Duration: 10 Aug 202513 Aug 2025

Publication series

Name2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025

Conference

Conference2025 IEEE/CIC International Conference on Communications in China, ICCC Workshops 2025
Country/TerritoryChina
CityShanghai
Period10/08/2513/08/25

Keywords

  • LEO communication
  • feature loss
  • low bitrate speech coding
  • residual block

Fingerprint

Dive into the research topics of 'A Multi-Scale Low-Bitrate Speech Codec for LEO Communication With Deep Feature Preservation Losses'. Together they form a unique fingerprint.

Cite this