DocumentCode :
3067847
Title :
Cepstral analysis synthesis on the mel frequency scale
Author :
Imai, Satoshi
Author_Institution :
Tokyo Institute of Technology, Yokohama, Japan
Volume :
8
fYear :
1983
fDate :
30407
Firstpage :
93
Lastpage :
96
Abstract :
This paper presents a new technique of cepstral analysis synthesis on the mel frequency scale, the log spectrum on the mel frequency scale (the mel log spectrum) is considered to be an effective representation of the spectral envelope of speech. This analysis synthesis system uses the mel log spectrum approximation (MLSA) filter which was devised for the cepstral synthesis on the mel frequency scale. The filter coefficients are easily obtained through a simple linear transform from the mel cepstrum defined as the Fourier cosine coefficients of the mel log spectral envelope of speech. The MLSA filter has a low coefficient sensitivity and a good coefficient quantization characteristics. The spectral distortion caused by interpolation of the filter parameters of two successive frames is small. Accordingly, the data rate of this system is very low. The same quality speech is synthesized at 60-70 % of data rates in the conventional cepstral vocoder or the LPC vocoder.
Keywords :
Cepstral analysis; Cepstrum; Fourier transforms; Frequency synthesizers; Mel frequency cepstral coefficient; Nonlinear filters; Quantization; Speech analysis; Speech synthesis; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.
Type :
conf
DOI :
10.1109/ICASSP.1983.1172250
Filename :
1172250
Link To Document :
بازگشت