Title :
Speaker-independent spoken digits recognition using LVQ
Author :
Kondo, K. ; Kamata, H. ; Ishida, Y.
Author_Institution :
Dept. of Electron. & Commun., Meiji Univ., Kawasaki, Japan
fDate :
27 Jun-2 Jul 1994
Abstract :
Presents a spoken Japanese digits recognition system using LVQ (learning vector quantization). LVQ is very effective for phoneme recognition and its algorithm is very simple. The authors try to utilize the LVQ algorithm using a word, not a phoneme, as one unit. Input vectors in the authors´ system are the mel-cepstrum coefficients generated from beginning points to end points of spoken digits. In the recognition process the authors only find the closest reference vector to the input vector. Experiments are executed for two cases. One is for some isolated spoken digits. The other is for some continuous spoken digits (the speech speed, V, is 1<V<3 [word/sec]). The recognition rate of isolated spoken digits was 99.2%. That of continuous spoken digits was 95.4%. Experimental results show this method is effective for spoken digits recognition
Keywords :
cepstral analysis; learning (artificial intelligence); neural nets; speech processing; speech recognition; vector quantisation; LVQ; continuous spoken digits; isolated spoken digits; learning vector quantization; mel-cepstrum coefficients; speaker-independent spoken digits recognition; spoken Japanese digits recognition system; Cepstral analysis; Cepstrum; Euclidean distance; Speech analysis; Speech recognition; Vector quantization;
Conference_Titel :
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1901-X
DOI :
10.1109/ICNN.1994.374986