A low cost dynamic vocabulary speech recognizer on a GPP-DSP system

Author

Kao, Yu-Hung ; Rajasekaran, P.K.

Author_Institution

Texas Instrum., USA

Volume

6

fYear

2000

fDate

2000

Firstpage

3215

Abstract

Continuous speech recognition is a resource-intensive algorithm. Commercial dictation software requires more than 10 Mbytes to install on the disk and 32 Mbytes RAM to run the application. Because of the resource requirement, such a system can not be implemented in a low cost and low power embedded system. We propose a design of dynamic vocabulary speech recognizer that will fit in a DSP-GPP (general purpose processor) architecture. The computation intensive, small footprint recognizer engine runs on the DSP; and the computation non-intensive, larger footprint grammar, dictionary, and model acoustic components resides on the GPP. The recognition models are prepared on the GPP and transferred to the DSP, the interaction among the application, model generation, and recognition modules is minimal. The result is a speech recognition server implemented in a low cost embedded system. The application can dynamically create flexible vocabulary to suit different recognition contexts. It still does not do large vocabulary dictation; however, it provides unlimited recognition contexts with unlimited vocabulary, all these implementable in a low cost embedded system

Keywords

digital signal processing chips; embedded systems; general purpose computers; speech recognition; GPP-DSP system; RAM; application module; commercial dictation software; continuous speech recognition; dictionary; general purpose processor; grammar; low cost dynamic vocabulary speech recognizer; low cost embedded system; model acoustic components; model generation; recognition contexts; recognition models; recognition module; small footprint recognizer engine; speech recognition server; Application software; Computer architecture; Costs; Dictionaries; Digital signal processing; Embedded system; Engines; Speech processing; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location

Istanbul

ISSN

1520-6149

Print_ISBN

0-7803-6293-4

Type

conf

DOI

10.1109/ICASSP.2000.860084

Filename

860084