Title :
Training machine classifiers to match the performance of human listeners in a natural vowel classification task
Author :
Hunke, Martin ; Holton, Thorn
Author_Institution :
Sch. of Eng., San Francisco State Univ., CA, USA
Abstract :
The purpose of this research is to determine how models of human auditory physiology can improve the performance of automatic speech recognition systems. In this study, a series of experiments was undertaken to discover how humans categorize and confuse vowels in natural speech. The recognition task comprised a large number of vowel nuclei isolated from naturally spoken sentences of a large number of talkers. Machine vowel classifiers were trained to match the results of these vowel categorization experiments using two input feature representations: a spectral-energy feature representation, and a representation derived from an auditory model. Classifiers trained to input representations derived from the auditory model match human performance and are more robust in the presence of noise and spectral filtering than classifiers trained to spectral-energy representations
Keywords :
feature extraction; noise; pattern classification; performance evaluation; physiological models; spectral analysis; speech recognition; auditory model; automatic speech recognition systems; human auditory physiology; human listeners; input feature representations; machine classifier training; natural speech; natural vowel classification task; naturally spoken sentences; noise; performance; research; spectral filtering; spectral-energy feature representation; vowel categorization experiments; Automatic speech recognition; Computer displays; Humans; Matched filters; Natural languages; Noise robustness; Physiology; Qualifications; Recruitment; Speech recognition;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607182