DocumentCode :
3520969
Title :
Shift-invariant, multi-category phoneme recognition using Kohonen´s LVQ2
Author :
McDermott, Erik ; Katagiri, Shigeru
Author_Institution :
ATR Auditory & Visual Perception Res. Lab., Osaka, Japan
fYear :
1989
fDate :
23-26 May 1989
Firstpage :
81
Abstract :
The authors describe a shift-tolerant neural network architecture for phoneme recognition. The system is based on LVQ2, an algorithm which pays close attention to approximating the optimal Bayes decision line in a discrimination task. Recognition performances in the 98-99% correct range were obtained for LVQ2 networks aimed at speaker-dependent recognition of phonemes in small but ambiguous Japanese phonemic classes. A correct recognition rate of 97.7% was achieved by a single, larger LVQ2 network covering all Japanese consonants. These recognition results are at least as high as those obtained in the time delay neural network system and suggest that LVQ2 could be the basis for a successful speech recognition system
Keywords :
neural nets; speech recognition; Japanese consonants; Kohonen´s LVQ2; learning vector quantisation; optimal Bayes decision line; phoneme recognition; shift-tolerant neural network; speech recognition; Delay effects; Laboratories; Machine learning; Neural networks; Poles and towers; Speech recognition; Vector quantization; Visual perception; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1989.266368
Filename :
266368
Link To Document :
بازگشت