Title :
A multi-task neural network approach to speech recognition
Author_Institution :
Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
Abstract :
Improving the speaker-independent generalization exhibited by neural network approaches to phoneme identification is an area of continuing interest to speech recognition researchers. The author reports research exploring the combined impact of multiple task constraints and differing speech input representations on network generalization. The multiple tasks required of the networks are based on a psychological model of speech perception. Using 12 American vowels to train and test the networks, the differing input representations are motivated by current theories of vowel perception and human audition. Network results compare favorably to baseline performance results established by a K-nearest neighbor classification and the classification performance of human listeners on the same task. These results are also extremely good when compared to performance reported by other researchers
Keywords :
constraint theory; generalisation (artificial intelligence); neural nets; speech recognition; American vowels; classification performance; multi-task neural network; multiple task constraints; network generalization; phoneme identification; psychological model; speaker-independent generalization; speech input representations; speech perception; speech recognition; Computer science; Encoding; Humans; Neural networks; Psychology; Speech recognition; Testing; Voting;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225884