• DocumentCode
    53993
  • Title

    Memory-efficient buffering method and enhanced reference template for embedded automatic speech recognition system

  • Author

    Chih-Hung Chou ; Ta-Wen Kuan ; Po-Chuan Lin ; Bo-Wei Chen ; Jhing-Fa Wang

  • Author_Institution
    Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    9
  • Issue
    3
  • fYear
    2015
  • fDate
    5 2015
  • Firstpage
    153
  • Lastpage
    164
  • Abstract
    This work realises a memory-efficient embedded automatic speech recognition (ASR) system on a resource-constrained platform. A buffering method called ultra-low queue-accumulator buffering is presented to efficiently use the constrained memory to extract the linear prediction cepstral coefficient (LPCC) feature in the embedded ASR system. The optimal order of the LPCC is evaluated to balance the recognition accuracy and the computational cost. In the decoding part, the proposed enhanced cross-words reference templates (CWRTs) method is incorporated into the template matching method to reach the speaker-independent characteristic of ASR tasks without the large memory burden of the conventional CWRTs method. The proposed techniques are implemented on a 16-bit microprocessor GPCE063A platform with a 49.152 MHz clock, using a sampling rate of 8 kHz. Experimental results demonstrate that recognition accuracy reaches 95.22% in a 30-sentence speaker-independent embedded ASR task, using only 0.75 kB RAM.
  • Keywords
    buffer storage; decoding; feature extraction; microprocessor chips; speaker recognition; speech coding; 30-sentence speaker-independent embedded ASR task; CWRTs method; LPCC feature extraction; RAM; constrained memory; decoding part; embedded ASR system; enhanced cross-word reference template method; enhanced reference template; frequency 49.152 MHz; frequency 8 kHz; linear prediction cepstral coefficient feature extraction; memory-efficient buffering method; memory-efficient embedded automatic speech recognition system; microprocessor GPCE063A platform; resource-constrained platform; speaker-independent characteristic; template matching method; ultra-low queue-accumulator buffering; word length 16 bit;
  • fLanguage
    English
  • Journal_Title
    Computers & Digital Techniques, IET
  • Publisher
    iet
  • ISSN
    1751-8601
  • Type

    jour

  • DOI
    10.1049/iet-cdt.2014.0008
  • Filename
    7101896