Title :
Speech recognition using pulse coupled neural network
Author :
Chandrasekaran, Prabhakar ; Bodruzzaman, M. ; Yuen, G. ; Malkani, M.
Author_Institution :
Center of Neural Eng., Tennessee State Univ., Nashville, TN, USA
Abstract :
The same word uttered by different people have different waveforms, similarly the same word uttered by the same person at different time will also have some degree of variations in waveforms. In this experiment a set of ten words was selected and each word was uttered ten times by ten different speakers. The objective of this work is to extract the information conveyed in the signal, especially if the information is not directly recognizable as in the case of the speech signals. One powerful tool is the spectral analysis, where important properties of a signal are often very clearly depicted in the frequency domain, there are always situations where the spectral content of a signal is changing in time. In these situation, joint time-frequency methods, such as the spectrogram representations are used. The spectrogram image representation is then used as an input image to train a pulse coupled neural network (PCNN) which outputs a feature time series representing the number of neurons that fired each time the same image was presented. The usable length of this time-series feature (icon) is about 50 data points which is very small as compared to 512×512 data points to represent an image. The features are found to be very robust and show high promise for speech and speaker recognition
Keywords :
frequency-domain analysis; neural nets; spectral analysis; speech recognition; PCNN; feature time series; frequency-domain depiction; joint time-frequency methods; pulse coupled neural network; speaker recognition; spectral analysis; spectrogram representations; speech recognition; speech signals; Data mining; Frequency domain analysis; Image representation; Neural networks; Neurons; Robustness; Spectral analysis; Spectrogram; Speech recognition; Time frequency analysis;
Conference_Titel :
System Theory, 1998. Proceedings of the Thirtieth Southeastern Symposium on
Conference_Location :
Morgantown, WV
Print_ISBN :
0-7803-4547-9
DOI :
10.1109/SSST.1998.660127