DocumentCode :
2320974
Title :
Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
Author :
Chazan, Dan ; Hoory, Ron ; Cohen, Gilad ; Zibulski, Meir
Author_Institution :
IBM Res. Lab., Haifa, Israel
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1299
Abstract :
This paper presents a novel low complexity, frequency domain algorithm for reconstruction of speech from the mel-frequency cepstral coefficients (MFCC), commonly used by speech recognition systems, and the pitch frequency values. The reconstruction technique is based on the sinusoidal speech representation. A set of sine-wave frequencies is derived using the pitch frequency and voicing decisions, and synthetic phases are then assigned to each respective sine wave. The sine-wave amplitudes are generated by sampling a linear combination of frequency domain basis functions. The basis function gains are determined such that the mel-frequency binned spectrum of the reconstructed speech is similar to the mel-frequency binned spectrum, obtained from the original MFCC vector by IDCT and antilog operations. Natural sounding, good quality intelligible speech is obtained by this procedure
Keywords :
cepstral analysis; computational complexity; frequency-domain synthesis; speech intelligibility; speech recognition; speech synthesis; MFCC; frequency domain basis functions; intelligible speech; low complexity frequency domain algorithm; mel frequency cepstral coefficients; mel-frequency binned spectrum; pitch frequency; sampling; sine-wave amplitudes; sine-wave frequencies; sinusoidal speech representation; speech recognition; speech reconstruction; synthetic phases; voicing decisions; Bandwidth; Bit rate; Cepstral analysis; Feature extraction; Frequency domain analysis; Laboratories; Mel frequency cepstral coefficient; Speech recognition; Speech synthesis; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.861816
Filename :
861816
Link To Document :
بازگشت