• DocumentCode
    3234189
  • Title

    A hybrid neural network, dynamic programming word spotter

  • Author

    Zeppenfeld, Torsten ; Waibel, Alex H.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    2
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    77
  • Abstract
    A novel keyword-spotting system that combines both neural network and dynamic programming techniques is presented. This system makes use of the strengths of time delay neural networks (TDNNs), which include strong generalization ability, potential for parallel implementations, robustness to noise, and time shift invariant learning. Dynamic programming models are used by this system because they have the useful capability of time warping input speech patterns. This system was trained and tested on the Stonehenge Road Rally database, which is a 20-keyword-vocabulary, speaker-independent, continuous-speech corpus. Currently, this system performs at a figure of merit (FOM) rate of 82.5%. FOM is the detection rate averaged from 0 to 10 false alarms per keyword hour. This measure is explained in detail
  • Keywords
    dynamic programming; neural nets; speech recognition; Stonehenge Road Rally database; TDNN; continuous-speech corpus; detection rate; dynamic programming word spotter; figure of merit; input speech patterns; speaker independent speech recognition; time delay neural networks; time shift invariant learning; time warping; vocabulary; Computer science; Dictionaries; Distributed databases; Dynamic programming; Neural networks; Noise robustness; Speech enhancement; Speech recognition; System testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.226116
  • Filename
    226116