DocumentCode :
3022471
Title :
Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients
Author :
Enbom, Niklas ; Kleijn, W. Bastiaan
Author_Institution :
Dept. of Speech, Music & Hearing, R. Inst. of Technol., Stockholm, Sweden
fYear :
1999
fDate :
1999
Firstpage :
171
Lastpage :
173
Abstract :
Telephone speech is usually limited to less than 4 kHz in bandwidth. This bandwidth limitation results in the typical sound of telephone speech. We present a new method of regenerating the high frequencies (4-8 kHz) based on vector quantization of the mel-frequency cepstral coefficients (MFCC). We also present two methods to avoid perceptually annoying overestimates of the signal power in the high-band. Listening tests show the benefits of the new procedures. Use of MFCC for vector quantization instead of traditionally used spectral representations improves the quality of the speech significantly. Tests also show that the wide-band speech reconstructed with the method is significantly more pleasant to the human ear than the original narrowband speech
Keywords :
cepstral analysis; speech coding; speech enhancement; telephony; vector quantisation; 4 to 8 kHz; bandwidth expansion; listening tests; mel frequency cepstral coefficients; speech quality improvement; telephone speech; vector quantization; wideband speech reconstruction; Bandwidth; Cepstral analysis; Ear; Humans; Mel frequency cepstral coefficient; Speech; Telephony; Testing; Vector quantization; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Coding Proceedings, 1999 IEEE Workshop on
Conference_Location :
Porvoo
Print_ISBN :
0-7803-5651-9
Type :
conf
DOI :
10.1109/SCFT.1999.781521
Filename :
781521
Link To Document :
بازگشت