DocumentCode
1264358
Title
A novel objective function for improved phoneme recognition using time-delay neural networks
Author
Hampshire, John B., II ; Waibel, Alexander H.
Author_Institution
Carnegie-Mellon Univ., Pittsburgh, PA, USA
Volume
1
Issue
2
fYear
1990
fDate
6/1/1990 12:00:00 AM
Firstpage
216
Lastpage
228
Abstract
Single-speaker and multispeaker recognition results are presented for the voice-stop consonants /b,d,g/ using time-delay neural networks (TDNNs) with a number of enhancements, including a new objective function for training these networks. The new objective function, called the classification figure of merit (CFM), differs markedly from the traditional mean-squared-error (MSE) objective function and the related cross entropy (CE) objective function. Where the MSE and CE objective functions seek to minimize the difference between each output node and its ideal activation, the CFM function seeks to maximize the difference between the output activation of the node representing incorrect classifications. A simple arbitration mechanism is used with all three objective functions to achieve a median 30% reduction in the number of misclassifications when compared to TDNNs trained with the traditional MSE back-propagation objective function alone
Keywords
delays; neural nets; speech recognition; classification figure of merit; cross entropy; mean-squared-error; objective function; phoneme recognition; time-delay neural networks; voice-stop consonants; Computer science; Costs; Entropy; Error analysis; Laboratories; Neural networks; Speech recognition; Statistics;
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/72.80233
Filename
80233
Link To Document