DocumentCode :
302075
Title :
Time-frequency representation based cepstral processing for speech recognition
Author :
Fineberg, Adam B. ; Yu, Kevin C.
Author_Institution :
Lexicus Div., Motorola Inc., Palo Alto, CA, USA
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
25
Abstract :
Both linear predictive coding (LPC) and mel scale frequency cepstral coefficient (MFCC) analysis, the most common techniques for speech recognition signal processing, make the assumption that the speech signal is stationary for some analysis window and produce a representation based upon the “stationary” frequency content within the window. This work uses a technique based upon Cohen´s (1989) class of generalized time frequency representations (TFR) to produce selected frequency representations that are not based upon an assumption of stationarity. This representation is used in a speech recognition system to produce improved accuracy. The proposed approach requires a kernel design to specify the attributes of the representations. The considerations used for analyzing speech signals and the resulting attributes are discussed. Comparisons with standard analysis techniques are presented. The significant computational requirements are also discussed
Keywords :
cepstral analysis; linear predictive coding; signal representation; speech coding; speech processing; speech recognition; time-frequency analysis; Cohen´s class; LPC analysis; MFCC analysis; analysis techniques; analysis window; cepstral processing; computational requirements; frequency representations; generalized time-frequency representation; kernel design; linear predictive coding; mel scale frequency cepstral coefficient; signal processing; speech recognition system; speech signal; stationary frequency content; Cepstral analysis; Linear predictive coding; Mel frequency cepstral coefficient; Signal analysis; Signal processing; Speech analysis; Speech coding; Speech processing; Speech recognition; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.540281
Filename :
540281
Link To Document :
بازگشت