Title :
On the application of spectrum target prediction model to speech recognition
Author :
Akagi, Masato ; Tohkura, Yoh´ichi
Author_Institution :
ATR Auditory & Visual Perception Res. Lab., Osaka, Japan
Abstract :
A preprocessing method is proposed for automatic speech recognition that uses a spectrum target prediction model to cope with coarticulation, one of the most serious problems in automatic speech recognition. The method is evaluated by three measures: spectral stability with respect to measuring predicted spectrum variation, and intracategory variation. Experimental results indicate that predicted spectra throughout the model are stabilized in each phoneme portion by eliminating variations of original spectra without prediction. The results also indicate that by using the preprocessing method, intracategory variation decreases and intercategory variation increases. Consequently, the spectrum target prediction model implemented as a speech-recognition preprocessor improves automatic speech recognition performance
Keywords :
filtering and prediction theory; spectral analysis; speech recognition; automatic speech recognition; coarticulation; intracategory variation; phoneme portion; predicted spectrum variation; preprocessing method; spectral stability; spectrum target prediction model; Automatic speech recognition; Damping; Databases; Humans; Parameter estimation; Predictive models; Speech analysis; Speech recognition; Target recognition; Visual perception;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
DOI :
10.1109/ICASSP.1988.196531