Title :
Speaker dependent and independent speech recognition experiments with an auditory model
Author :
Hunt, Melvyn J. ; Lefèbvre, Claude
Author_Institution :
Nat. Res. Council of Canada, Ottawa, Ont., Canada
Abstract :
The performance of an auditory model has been compared with that of a conventional filterbank mel-cepstrum representation in speaker-dependent and speaker-independent spoken digit recognition tests. The model produces two outputs: one sensitive to voicing and onsets, and the other sensitive to formant structure and showing two-tone suppression. Linear discriminant analysis has been used to combine the outputs into eight coefficients. Undegraded, noisy and spectrally tilted male speech was tested with a quasi-isolated-word system. A subset of the tests were repeated with a connected-word system, and with undegraded female speech. In all cases the model performed better than the conventional representation. With degraded speech the differences were large
Keywords :
speech recognition; auditory model; connected-word system; degraded speech; filterbank mel-cepstrum representation; formant structure; linear discriminant analysis; noisy speech; onsets; quasi-isolated-word system; speaker-dependent; speaker-independent; spectrally tilted male speech; speech recognition experiments; spoken digit recognition tests; two-tone suppression; undegraded female speech; voicing; Councils; Detectors; Filter bank; Linear discriminant analysis; Power harmonic filters; Psychoacoustic models; Resonance; Spectrogram; Speech recognition; System testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
DOI :
10.1109/ICASSP.1988.196552