Title :
Time-frequency representation based cepstral processing for speech recognition
Author :
Fineberg, Adam B. ; Yu, Kevin C.
Author_Institution :
Lexicus Div., Motorola Inc., Palo Alto, CA, USA
Abstract :
Both linear predictive coding (LPC) and mel scale frequency cepstral coefficient (MFCC) analysis, the most common techniques for speech recognition signal processing, make the assumption that the speech signal is stationary for some analysis window and produce a representation based upon the “stationary” frequency content within the window. This work uses a technique based upon Cohen´s (1989) class of generalized time frequency representations (TFR) to produce selected frequency representations that are not based upon an assumption of stationarity. This representation is used in a speech recognition system to produce improved accuracy. The proposed approach requires a kernel design to specify the attributes of the representations. The considerations used for analyzing speech signals and the resulting attributes are discussed. Comparisons with standard analysis techniques are presented. The significant computational requirements are also discussed
Keywords :
cepstral analysis; linear predictive coding; signal representation; speech coding; speech processing; speech recognition; time-frequency analysis; Cohen´s class; LPC analysis; MFCC analysis; analysis techniques; analysis window; cepstral processing; computational requirements; frequency representations; generalized time-frequency representation; kernel design; linear predictive coding; mel scale frequency cepstral coefficient; signal processing; speech recognition system; speech signal; stationary frequency content; Cepstral analysis; Linear predictive coding; Mel frequency cepstral coefficient; Signal analysis; Signal processing; Speech analysis; Speech coding; Speech processing; Speech recognition; Time frequency analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.540281