Title :
Unsupervised learning the hidden structure of speech
Author :
Hambaba, Mohamed L. ; Charchali, Ali
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Stevens Inst. of Technol., Hoboken, NJ, USA
Abstract :
The unsupervised neural network learning procedure is applied to the analysis and recognition of speech. This procedure takes a set of input patterns and attempts to learn their function; it develops the necessary representational features during the course of learning. A series of computer simulation studies was carried out to assess the ability of these networks to label sounds accurately, to learn to recognize sounds without labels, and to learn feature representations of continuous speech. These studies demonstrate that the networks can learn to label presegmented test tokens with accuracies of up to 99%. These networks developed rich internal representation. There is no clock; the circuit is data driven, and there is no necessity for endpoint detection or segmentation of the speech signal during recognition. Training in the presence of noise provides noise immunity up to the trained level. For the speech problem studied, the circuit connection only need to be accurate to about a 3-b digitization depth for optimum performance. The algorithm used maps efficiently onto a simple VLSI hardware chip
Keywords :
learning systems; neural nets; speech recognition; VLSI; endpoint detection; machine learning; segmentation; sounds labelling; speech recognition; unsupervised neural network learning; Acoustic noise; Circuit noise; Circuit testing; Clocks; Computer simulation; Neural networks; Noise level; Speech analysis; Speech recognition; Unsupervised learning;
Conference_Titel :
System Theory, 1990., Twenty-Second Southeastern Symposium on
Conference_Location :
Cookeville, TN
Print_ISBN :
0-8186-2038-2
DOI :
10.1109/SSST.1990.138227