DocumentCode :
3230119
Title :
A multi-task neural network approach to speech recognition
Author :
Richards, E.L.
Author_Institution :
Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
413
Abstract :
Improving the speaker-independent generalization exhibited by neural network approaches to phoneme identification is an area of continuing interest to speech recognition researchers. The author reports research exploring the combined impact of multiple task constraints and differing speech input representations on network generalization. The multiple tasks required of the networks are based on a psychological model of speech perception. Using 12 American vowels to train and test the networks, the differing input representations are motivated by current theories of vowel perception and human audition. Network results compare favorably to baseline performance results established by a K-nearest neighbor classification and the classification performance of human listeners on the same task. These results are also extremely good when compared to performance reported by other researchers
Keywords :
constraint theory; generalisation (artificial intelligence); neural nets; speech recognition; American vowels; classification performance; multi-task neural network; multiple task constraints; network generalization; phoneme identification; psychological model; speaker-independent generalization; speech input representations; speech perception; speech recognition; Computer science; Encoding; Humans; Neural networks; Psychology; Speech recognition; Testing; Voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225884
Filename :
225884
Link To Document :
بازگشت