Skip to main navigation Skip to search Skip to main content

A new approach to modeling excitation in very low-rate speech coding

  • Queensland University of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

A new method for two-band approximation of excitation signals in an LPC model, to improve speech naturalness in very low rate coding, is proposed. Based on a simplified model of multi-band excitation, the method accurately determines the degree of periodicity, using the concept of instantaneous frequency (IF) estimation in the frequency domain. The harmonic structure in the spectrum of LPC residual, within individual bands, is identified based on flatness of the IF as a criterion for pitch and voicing detection. On this basis, the excitation is modelled by combining a predefined periodic signal in the lower band and a random signal in the higher band. It is shown that this improves considerably the naturalness of reconstructed speech in very low rate coding in comparison with that obtained using traditional binary excitation. The performance of the technique is also given in temporal decomposition (TD) based coding at 800 b/s.

Original languageEnglish
Title of host publicationProceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998
Pages597-600
Number of pages4
DOIs
StatePublished - 1998
Externally publishedYes
Event1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998 - Seattle, WA, United States
Duration: 12 May 199815 May 1998

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2
ISSN (Print)1520-6149

Conference

Conference1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998
Country/TerritoryUnited States
CitySeattle, WA
Period12/05/9815/05/98

Fingerprint

Dive into the research topics of 'A new approach to modeling excitation in very low-rate speech coding'. Together they form a unique fingerprint.

Cite this