Title :
Fuzzy partition models and their effect in continuous speech recognition
Author :
Kato, Y. ; Sugiyama, M.
Author_Institution :
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
fDate :
31 Aug-2 Sep 1992
Abstract :
Fuzzy partition models (FPMs) with multiple input-output units were applied to continuous speech recognition, and the use of automatic incremental training was evaluated. After initial training using word data, phrase recognition rates of 72.7% and 66.9% were obtained for an FPM and a TDNN (time-delay neural network), respectively. After incremental training, the phrase recognition rates improved to 86.3% and 78.4%, respectively. The FPMs provided more accurate segmentation after incremental training. The experiments determined that better phoneme segmentation provides greater improvement in phrase recognition. Incremental training also significantly improves recognition performance. As FPMs can be trained rapidly, various applications using large-scale training data are also possible
Keywords :
fuzzy logic; learning (artificial intelligence); neural nets; speech recognition; automatic incremental training; continuous speech recognition; fuzzy partition models; multiple input-output units; phoneme segmentation; phrase recognition rates; time-delay neural network; word data; Automatic speech recognition; Feedforward systems; Fuzzy neural networks; Laboratories; Neural networks; Partitioning algorithms; Speech recognition; Telephony; Viterbi algorithm;
Conference_Titel :
Neural Networks for Signal Processing [1992] II., Proceedings of the 1992 IEEE-SP Workshop
Conference_Location :
Helsingoer
Print_ISBN :
0-7803-0557-4
DOI :
10.1109/NNSP.1992.253702