DocumentCode
3230119
Title
A multi-task neural network approach to speech recognition
Author
Richards, E.L.
Author_Institution
Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
Volume
1
fYear
1992
fDate
23-26 Mar 1992
Firstpage
413
Abstract
Improving the speaker-independent generalization exhibited by neural network approaches to phoneme identification is an area of continuing interest to speech recognition researchers. The author reports research exploring the combined impact of multiple task constraints and differing speech input representations on network generalization. The multiple tasks required of the networks are based on a psychological model of speech perception. Using 12 American vowels to train and test the networks, the differing input representations are motivated by current theories of vowel perception and human audition. Network results compare favorably to baseline performance results established by a K -nearest neighbor classification and the classification performance of human listeners on the same task. These results are also extremely good when compared to performance reported by other researchers
Keywords
constraint theory; generalisation (artificial intelligence); neural nets; speech recognition; American vowels; classification performance; multi-task neural network; multiple task constraints; network generalization; phoneme identification; psychological model; speaker-independent generalization; speech input representations; speech perception; speech recognition; Computer science; Encoding; Humans; Neural networks; Psychology; Speech recognition; Testing; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.225884
Filename
225884
Link To Document