مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker-independent spoken digits recognition using LVQ

DocumentCode :

2444453

Title :

Speaker-independent spoken digits recognition using LVQ

Author :

Kondo, K. ; Kamata, H. ; Ishida, Y.

Author_Institution :

Dept. of Electron. & Commun., Meiji Univ., Kawasaki, Japan

Volume :

fYear :

1994

fDate :

27 Jun-2 Jul 1994

Firstpage :

4448

Abstract :

Presents a spoken Japanese digits recognition system using LVQ (learning vector quantization). LVQ is very effective for phoneme recognition and its algorithm is very simple. The authors try to utilize the LVQ algorithm using a word, not a phoneme, as one unit. Input vectors in the authors´ system are the mel-cepstrum coefficients generated from beginning points to end points of spoken digits. In the recognition process the authors only find the closest reference vector to the input vector. Experiments are executed for two cases. One is for some isolated spoken digits. The other is for some continuous spoken digits (the speech speed, V, is 1<V<3 [word/sec]). The recognition rate of isolated spoken digits was 99.2%. That of continuous spoken digits was 95.4%. Experimental results show this method is effective for spoken digits recognition

Keywords :

cepstral analysis; learning (artificial intelligence); neural nets; speech processing; speech recognition; vector quantisation; LVQ; continuous spoken digits; isolated spoken digits; learning vector quantization; mel-cepstrum coefficients; speaker-independent spoken digits recognition; spoken Japanese digits recognition system; Cepstral analysis; Cepstrum; Euclidean distance; Speech analysis; Speech recognition; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on

Conference_Location :

Orlando, FL

Print_ISBN :

0-7803-1901-X

Type :

conf

DOI :

10.1109/ICNN.1994.374986

Filename :

374986

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2444453