• DocumentCode
    233258
  • Title

    Development of Small Footprint Korean Large Vocabulary Speech Recognition for Commanding a Standalone Robot

  • Author

    Donghyun Lee ; Minkyu Lim ; Myoung-Wan Koo ; Jungyun Seo ; Gil-Jin Jang ; Ji-Hwan Kim ; Jeong-Sik Park

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
  • fYear
    2014
  • fDate
    8-10 Nov. 2014
  • Firstpage
    536
  • Lastpage
    540
  • Abstract
    The work in this paper concerns a small footprint Acoustic Model (AM) and its use in the implementation of a Large Vocabulary Isolated Speech Recognition (LVISR) system for commanding a robot in the Korean language, which requires about 500KB of memory. Tree-based state clustering was applied to reduce the number of total unique states, while preserving its original performance. A decision tree induction method was developed for the tree-based state clustering. For this method, a binary question set, measurement function and stopping criterion were devised. A phoneme set consisting of 38 phonemes was defined for the implementation of small footprint Korean LVISR. Further reduction in memory requirement was achieved through integer arithmetic operation. The best multiplication factor was determined for this operation. As a result, we successfully developed a small footprint Korean LVISR that requires memory space about 500KB.
  • Keywords
    decision trees; human-robot interaction; natural language processing; pattern clustering; speech recognition; storage management; AM; Korean language; binary question set; decision tree induction method; integer arithmetic operation; large vocabulary isolated speech recognition system; measurement function; memory requirement reduction; memory space; multiplication factor; phoneme set; small footprint Korean LVISR; small footprint Korean large vocabulary speech recognition; small footprint acoustic model; standalone robot; stopping criterion; tree-based state clustering; Acoustics; Hidden Markov models; Memory management; Robots; Speech; Speech recognition; Vocabulary; Korean large vocabulary speech recognition; small footprint; standalone robot;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Broadband and Wireless Computing, Communication and Applications (BWCCA), 2014 Ninth International Conference on
  • Conference_Location
    Guangdong
  • Print_ISBN
    978-1-4799-4174-2
  • Type

    conf

  • DOI
    10.1109/BWCCA.2014.112
  • Filename
    7016130