DocumentCode :
3520804
Title :
Phonetically-based vector excitation coding of speech at 3.6 kbps
Author :
Wang, Shihua ; Gersho, Allen
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
fYear :
1989
fDate :
23-26 May 1989
Firstpage :
49
Abstract :
A phonetically based segmentation of speech is performed to classify segments into five classes: onset, unvoiced low-pass voiced, steady-state voiced, and transient voiced. The segment lengths are constrained to an integer multiple of a unit-frame. For each segment class, a distinctive coding scheme based on vector excitation coding (VXC) is used. The maximum bit-rate is 3.6 kb/s, and a moderate coding delay of 45 ms is incurred. Performance is roughly comparable to conventional VXC/CELP (code-excited linear prediction) coding at 4.8 kb/s
Keywords :
encoding; filtering and prediction theory; speech analysis and processing; 3.6 kbit/s; VXC/CELP; low-pass voiced; onset; phonetically based segmentation; speech coding; steady-state voiced; transient voiced; unvoiced; vector excitation coding; Degradation; Delay; Distortion measurement; Filters; Linear predictive coding; Speech coding; Speech synthesis; Steady-state; Vectors; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1989.266360
Filename :
266360
Link To Document :
بازگشت