• DocumentCode
    1102383
  • Title

    Comparative study of nonlinear time warping techniques in isolated word speech recognition systems

  • Author

    Waibel, A. ; Yegnanarayana, B.

  • Author_Institution
    Carnegie-Mellon University, Pittsburgh, PA, USA
  • Volume
    31
  • Issue
    6
  • fYear
    1983
  • fDate
    12/1/1983 12:00:00 AM
  • Firstpage
    1582
  • Lastpage
    1586
  • Abstract
    In this paper, the effects of two major design choices on the performance of an isolated word speech recognition system are examined in detail. They are: 1) the choice of a warping algorithm among the Itakura asymmetric, the Sakoe and Chiba symmetric, and the Sakoe and Chiba asymmetric, and 2) the size of the warping window to reduce computation time. Two vocabularies were used: the digits (zero, one,..., nine) and a highly confusable subset of the alphabet (b, c, d, e, g, p, t, v, z). The Itakura asymmetric warping algorithm appears to be slightly better than the other two for the confusable vocabulary. We discuss the reasons why the performance of the algorithms is vocabulary dependent. Finally, for the data used in our experiments, a warping window of about 100 ms appears to be optimal.
  • Keywords
    Acoustic signal processing; Algorithm design and analysis; Computer science; Databases; Signal processing algorithms; Space technology; Speech processing; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1983.1164241
  • Filename
    1164241