TY - GEN
T1 - A hybrid LP-harmonics model for low bit-rate speech compression with natural quality
AU - Abu-Shikhah, Nazih
AU - Deriche, Mohamed
PY - 2001
Y1 - 2001
N2 - We present a new LP-harmonic based speech codec. At the coder the speech signal is pre-processed, and an LP analysis is carried out together with pitch estimation and voicing decision. At the decoder and when the frame is voiced, the encoded parameters are used to estimate the spectrum envelope, and extract and classify the harmonies as either strong or weak depending on their relative distance from multiples of the fundamental frequency. Strong harmonics parameters are then used to generate pure sinusoids, while weak harmonics are used to generate a mixed signal of a pure sinusoid and a random like signal. For unvoiced frames, the excitation for the LP filter is expressed as a white noise signal. The proposed model allows for the mixing of strong and weak periodic signals together with random signals to produce an excitation input that results in natural speech. Informal testing of two versions of the coder operating at 1.82, and 2.56 kbit/s showed that the output speech has high intelligibility, with quality comparable to that of a 4 kbit/s sinusoidal codec.
AB - We present a new LP-harmonic based speech codec. At the coder the speech signal is pre-processed, and an LP analysis is carried out together with pitch estimation and voicing decision. At the decoder and when the frame is voiced, the encoded parameters are used to estimate the spectrum envelope, and extract and classify the harmonies as either strong or weak depending on their relative distance from multiples of the fundamental frequency. Strong harmonics parameters are then used to generate pure sinusoids, while weak harmonics are used to generate a mixed signal of a pure sinusoid and a random like signal. For unvoiced frames, the excitation for the LP filter is expressed as a white noise signal. The proposed model allows for the mixing of strong and weak periodic signals together with random signals to produce an excitation input that results in natural speech. Informal testing of two versions of the coder operating at 1.82, and 2.56 kbit/s showed that the output speech has high intelligibility, with quality comparable to that of a 4 kbit/s sinusoidal codec.
UR - https://www.scopus.com/pages/publications/84904285024
U2 - 10.1109/ISSPA.2001.950250
DO - 10.1109/ISSPA.2001.950250
M3 - Conference contribution
AN - SCOPUS:84904285024
SN - 0780367030
SN - 9780780367036
T3 - 6th International Symposium on Signal Processing and Its Applications, ISSPA 2001 - Proceedings; 6 Tutorials in Communications, Image Processing and Signal Analysis
SP - 723
EP - 726
BT - 6th International Symposium on Signal Processing and Its Applications, ISSPA 2001 - Proceedings; 6 Tutorials in Communications, Image Processing and Signal Analysis
PB - IEEE Computer Society
T2 - 6th International Symposium on Signal Processing and Its Applications, ISSPA 2001
Y2 - 13 August 2001 through 16 August 2001
ER -