• DocumentCode
    1264358
  • Title

    A novel objective function for improved phoneme recognition using time-delay neural networks

  • Author

    Hampshire, John B., II ; Waibel, Alexander H.

  • Author_Institution
    Carnegie-Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • Issue
    2
  • fYear
    1990
  • fDate
    6/1/1990 12:00:00 AM
  • Firstpage
    216
  • Lastpage
    228
  • Abstract
    Single-speaker and multispeaker recognition results are presented for the voice-stop consonants /b,d,g/ using time-delay neural networks (TDNNs) with a number of enhancements, including a new objective function for training these networks. The new objective function, called the classification figure of merit (CFM), differs markedly from the traditional mean-squared-error (MSE) objective function and the related cross entropy (CE) objective function. Where the MSE and CE objective functions seek to minimize the difference between each output node and its ideal activation, the CFM function seeks to maximize the difference between the output activation of the node representing incorrect classifications. A simple arbitration mechanism is used with all three objective functions to achieve a median 30% reduction in the number of misclassifications when compared to TDNNs trained with the traditional MSE back-propagation objective function alone
  • Keywords
    delays; neural nets; speech recognition; classification figure of merit; cross entropy; mean-squared-error; objective function; phoneme recognition; time-delay neural networks; voice-stop consonants; Computer science; Costs; Entropy; Error analysis; Laboratories; Neural networks; Speech recognition; Statistics;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/72.80233
  • Filename
    80233