Title :
Continuous speech recognition based on high plausibility regions
Author :
Gong, Yifan ; Haton, Jean-Paul ; Mouria, Feriel
Author_Institution :
CRI INRIA-Lorraine, Vandoeuvre, France
Abstract :
The authors propose an approach to phoneme-based continuous speech recognition when a time function of the plausibility of observing each phoneme (spotting result) is given. They introduce a criterion for the best sentence, based on the sum of plausibilities of individual symbols composing the sentence. Based on the idea of making use of high plausibility regions to reduce the computational load while maintaining optimality, the method finds the most plausible sentences relating to the input speech. Two optimization procedures are defined to deal with the following embedded search processes: (1) finding the best path connecting peaks of the plausibility functions of two successive symbols, and (2) finding the best time transition slot index for two given peaks. Experimental results show that the method gives better recognition precision while requiring about 1/20 of the computing time of the traditional DP-based methods. The experimental system obtained a 95% sentence recognition rate on a multispeaker test
Keywords :
speech recognition; computational load; embedded search processes; high plausibility regions; most plausible sentences; multispeaker test; phoneme-based continuous speech recognition; time function; time transition slot index; two successive symbols; Artificial neural networks; Dynamic programming; Hidden Markov models; Impedance matching; Joining processes; Robustness; Speech recognition; System testing; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150442