Title :
Integrating syllable boundary information into speech recognition
Author :
Wu, Su-Lin ; Shire, Michael L. ; Greenberg, Steven ; Morgan, Nelson
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
Abstract :
We examine the proposition that knowledge of the timing of syllabic onsets may be useful in improving the performance of speech recognition systems. A method of estimating the location of syllable onsets derived from the analysis of energy trajectories in critical band channels has been developed, and a syllable-based decoder has been designed and implemented that incorporates this onset information into the speech recognition process. For a small, continuous speech recognition task the addition of artificial syllabic onset information (derived from advance knowledge of the word transcriptions) lowers the word error rate by 38%. Incorporating acoustically-derived syllabic onset information reduces the word error rate by 10% on the same task. The latter experiment has highlighted representational issues on coordinating acoustic and lexical syllabifications, a topic we are beginning to explore
Keywords :
acoustic signal processing; decoding; parameter estimation; speech processing; speech recognition; timing; acoustic syllabification; artificial syllabic onset information; automatic speech recognition systems; continuous speech recognition task; critical band channels; energy trajectories analysis; experiment; lexical syllabification; syllabic onsets timimg; syllable based decoder; syllable boundary information; syllable onsets location estimation; system performance; word error rate reduction; word transcriptions; Automatic speech recognition; Computer science; Decoding; Error analysis; Filters; Hidden Markov models; Psychology; Speech analysis; Speech processing; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596105