DocumentCode
2296031
Title
A low cost dynamic vocabulary speech recognizer on a GPP-DSP system
Author
Kao, Yu-Hung ; Rajasekaran, P.K.
Author_Institution
Texas Instrum., USA
Volume
6
fYear
2000
fDate
2000
Firstpage
3215
Abstract
Continuous speech recognition is a resource-intensive algorithm. Commercial dictation software requires more than 10 Mbytes to install on the disk and 32 Mbytes RAM to run the application. Because of the resource requirement, such a system can not be implemented in a low cost and low power embedded system. We propose a design of dynamic vocabulary speech recognizer that will fit in a DSP-GPP (general purpose processor) architecture. The computation intensive, small footprint recognizer engine runs on the DSP; and the computation non-intensive, larger footprint grammar, dictionary, and model acoustic components resides on the GPP. The recognition models are prepared on the GPP and transferred to the DSP, the interaction among the application, model generation, and recognition modules is minimal. The result is a speech recognition server implemented in a low cost embedded system. The application can dynamically create flexible vocabulary to suit different recognition contexts. It still does not do large vocabulary dictation; however, it provides unlimited recognition contexts with unlimited vocabulary, all these implementable in a low cost embedded system
Keywords
digital signal processing chips; embedded systems; general purpose computers; speech recognition; GPP-DSP system; RAM; application module; commercial dictation software; continuous speech recognition; dictionary; general purpose processor; grammar; low cost dynamic vocabulary speech recognizer; low cost embedded system; model acoustic components; model generation; recognition contexts; recognition models; recognition module; small footprint recognizer engine; speech recognition server; Application software; Computer architecture; Costs; Dictionaries; Digital signal processing; Embedded system; Engines; Speech processing; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.860084
Filename
860084
Link To Document