TY - GEN
T1 - New time-frequency vowel quantization enhanced by subband hierarchy
AU - Salam, Fraihat
AU - Hervé, Glotin
PY - 2008
Y1 - 2008
N2 - Speech dynamics may not well be addressed by the conventional speech processing. We analyse here a new quantization paradigm for vowel coding. It is based on simple Allen temporal interval algebra applied on subband voicing levels, yielding to a compressed speech representation of only 21 integers for a speech window up to 32 ms long. Experiments show that we take advantage of the ranking of the average values of the voicing interval accross the various subbands. Theses new features are evaluated for vowel recognition (1 hour, 6 vowels) on a referenced multispeaker radio broadcast news used during evaluation campaign ESTER. We work on the subset of the most frequent french vowels. We get 62% class error rate adding the ranking information to the Allen's relations, instead of 70% using Allen relations alone, and 57% the set of the raw 48 floats. We then discuss on the advantage of using more subbands, and we finaly propose a strategy to tackle the combinatorial complexity of Allen relations.
AB - Speech dynamics may not well be addressed by the conventional speech processing. We analyse here a new quantization paradigm for vowel coding. It is based on simple Allen temporal interval algebra applied on subband voicing levels, yielding to a compressed speech representation of only 21 integers for a speech window up to 32 ms long. Experiments show that we take advantage of the ranking of the average values of the voicing interval accross the various subbands. Theses new features are evaluated for vowel recognition (1 hour, 6 vowels) on a referenced multispeaker radio broadcast news used during evaluation campaign ESTER. We work on the subset of the most frequent french vowels. We get 62% class error rate adding the ranking information to the Allen's relations, instead of 70% using Allen relations alone, and 57% the set of the raw 48 floats. We then discuss on the advantage of using more subbands, and we finaly propose a strategy to tackle the combinatorial complexity of Allen relations.
KW - Allen temporal algebra
KW - Automatic speech recognition
KW - Quantization
KW - Speech analysis
KW - Time-frequency
UR - https://www.scopus.com/pages/publications/55849135859
M3 - Conference contribution
AN - SCOPUS:55849135859
SN - 9789898111609
T3 - SIGMAP 2008 - Proceedings of the International Conference on Signal Processing and Multimedia Applications
SP - 189
EP - 192
BT - SIGMAP 2008 - Proceedings of the International Conference on Signal Processing and Multimedia Applications
T2 - SIGMAP 2008 - International Conference on Signal Processing and Multimedia Applications
Y2 - 26 July 2008 through 29 July 2008
ER -