• DocumentCode
    83799
  • Title

    Optimization of Weighted Finite State Transducer for Speech Recognition

  • Author

    Aubert, L. ; Woods, Roger ; Fischaber, Scott ; Veitch, R.

  • Author_Institution
    Applic. Solutions (Electron. & Vision) Ltd., Redhill, UK
  • Volume
    62
  • Issue
    8
  • fYear
    2013
  • fDate
    Aug. 2013
  • Firstpage
    1607
  • Lastpage
    1615
  • Abstract
    There is considerable interest in creating embedded, speech recognition hardware using the weighted finite state transducer (WFST) technique but there are performance and memory usage challenges. Two system optimization techniques are presented to address this; one approach improves token propagation by removing the WFST epsilon input arcs; another one-pass, adaptive pruning algorithm gives a dramatic reduction in active nodes to be computed. Results for memory and bandwidth are given for a 5,000 word vocabulary giving a better practical performance than conventional WFST; this is then exploited in an adaptive pruning algorithm that reduces the active nodes from 30,000 down to 4,000 with only a 2 percent sacrifice in speech recognition accuracy; these optimizations lead to a more simplified design with deterministic performance.
  • Keywords
    DRAM chips; SRAM chips; finite state machines; optimisation; speech recognition; storage management; transducers; DRAM memory solutions; SRAM memory solutions; WFST epsilon input arcs; adaptive pruning algorithm; memory usage challenges; one-pass pruning algorithm; performance challenges; speech recognition hardware; system optimization techniques; token propagation; weighted finite state transducer optimization; Acoustics; Bandwidth; Decoding; Hidden Markov models; Loading; Speech; Speech recognition; Embedded processors; WFST; memory organization; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2013.51
  • Filename
    6475937