Title :
Use of a self-learning neuro-fuzzy system for syllabic labeling of continuous speech
Author :
Hsieh, Ching-Tang ; Su, Mu-Chun ; Chienn, Shih-Chieh
Author_Institution :
Dept. of Electr. Eng., Tamkang Univ., Tamsui, Taiwan
Abstract :
For reducing the requirement of large memory and minimizing computation complexity in a large-vocabulary continuous speech recognition system, speech segmentation plays an important role. In this paper, the authors formulate the speech segmentation as a two-phase problem. Phase 1 (frame labelling) involves labeling frames of speech data. Frames are classified into three types: (1) silence; (2) consonants; and (3) vowels according to two segmentation features. In phase 2 (syllabic unit segmentation) the authors apply the concept of transition states to segment continuous speech data into syllabic units based on the labeled frames. The novel class of hyperrectangular composite neural networks (HRCNs) is used to cluster frames. The HRCNNs integrate the rule-based approach and neural network paradigms, therefore, this special hybrid system may neutralize the disadvantages of each alternative. The parameters in the trained HRCNNs are utilized to extract both crisp and fuzzy classification rules. Four speakers´ continuous reading-rate Mandarin speech are given to illustrate the proposed two-phase speech segmentation model. In the authors´ experiments, the performance of the HRCNNs is better than the “distributed fuzzy rule” approach based on the comparisons of the number of rules and the correct recognition rate
Keywords :
fuzzy logic; neural nets; pattern classification; speech recognition; unsupervised learning; classification rules; consonants; continuous reading-rate Mandarin speech; continuous speech; frame labelling; hyperrectangular composite neural networks; large-vocabulary continuous speech recognition system; rule-based approach; self-learning neuro-fuzzy system; silence; speech segmentation; syllabic labeling; syllabic unit segmentation; transition states; vowels; Backpropagation; Fuzzy neural networks; Fuzzy systems; Genetic algorithms; Input variables; Labeling; Neural networks; Power generation; Prototypes; Speech recognition;
Conference_Titel :
Fuzzy Systems, 1995. International Joint Conference of the Fourth IEEE International Conference on Fuzzy Systems and The Second International Fuzzy Engineering Symposium., Proceedings of 1995 IEEE Int
Conference_Location :
Yokohama
Print_ISBN :
0-7803-2461-7
DOI :
10.1109/FUZZY.1995.409915