DocumentCode
77637
Title
A 6 mW, 5,000-Word Real-Time Speech Recognizer Using WFST Models
Author
Price, Michael ; Glass, James ; Chandrakasan, Anantha P.
Author_Institution
Electr. Eng. & Comput. Sci. Dept., Massachusetts Inst. of Technol., Cambridge, MA, USA
Volume
50
Issue
1
fYear
2015
fDate
Jan. 2015
Firstpage
102
Lastpage
112
Abstract
We describe an IC that provides a local speech recognition capability for a variety of electronic devices. We start with a generic speech decoder architecture that is programmable with industry-standard WFST and GMM speech models. Algorithm and architectural enhancements are incorporated in order to achieve real-time performance amid system-level constraints on internal memory size and external memory bandwidth. A 2.5 × 2.5 mm test chip implementing this architecture was fabricated using a 65 nm process. The chip performs a 5,000 word recognition task in real-time with 13.0% word error rate, 6.0 mW core power consumption, and a search efficiency of approximately 16 nJ per hypothesis.
Keywords
Gaussian processes; speech recognition; GMM speech models; WFST models; external memory bandwidth; generic speech decoder architecture; internal memory size; local speech recognition capability; real-time speech recognizer; Bandwidth; Decoding; Hidden Markov models; Random access memory; Real-time systems; Speech; Speech recognition; CMOS digital integrated circuits; Gaussian mixture models (GMM); low-power electronics; speech recognition; weighted finite-state transducers (WFST);
fLanguage
English
Journal_Title
Solid-State Circuits, IEEE Journal of
Publisher
ieee
ISSN
0018-9200
Type
jour
DOI
10.1109/JSSC.2014.2367818
Filename
6975250
Link To Document