Title :
Application of the modulation model to speech recognition
Author :
Fineberg, Adam B. ; Mammone, Richard J. ; Flanagan, James L.
Author_Institution :
Center for Comput. Aids for Ind. Productivity, Rutgers Univ., Piscataway, NJ, USA
Abstract :
A novel method of characterizing speech signals is presented in which the signal is modeled in terms of the mean radian frequency, the amplitude modulating function and the frequency modulating function of each frequency domain component (i.e., formants). This model defines the component structure and represents the local characteristics of the component and its bandwidth. A vowel classification task is presented where a distance metric of the relative modulation functions is used. This technique has been found to be robust to inter- and intraspeaker variability
Keywords :
amplitude modulation; frequency modulation; speech analysis and processing; speech recognition; amplitude modulating function; bandwidth; classification task; component structure; distance metric; formants; frequency domain component; frequency modulating function; interspeaker variability; intraspeaker variability; local characteristics; mean radian frequency; modulation model; speech recognition; speech signals; vowel recognition; Amplitude modulation; Bandwidth; Frequency modulation; Hidden Markov models; Phase modulation; Robustness; Signal analysis; Spectrogram; Speech analysis; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225851