Title :
Modulation features for speech recognition
Author :
Dimitriadis, Dimitrios ; Maragos, Petros ; Potamianos, Alexandros
Author_Institution :
Dept. ECE, National Technical University of Athens, Zografou, 15773, Greece
Abstract :
Automatic speech recognition (ASR) systems can benefit from including into their acoustic processing part new features that account for various nonlinear and time-varying phenomena during speech production. In this paper, we develop robust methods to extract novel acoustic features from speech signals of the modulation type based on time-varying models for speech analysis. Further, we integrate the new speech features with the standard linear ones (mel-frequency cesptrum) to develop a augmented set of acoustic features and demonstrate its efficacy by showing significant improvements in HMM-based word recognition over the TIMIT database.
Keywords :
Cepstrum; Frequency modulation; Information filters; Smoothing methods;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743733