Title :
On design and implementation of an embedded automatic speech recognition system
Author :
Phadke, Sujay ; Limaye, Rhishikesh ; Verma, Siddharth ; Subramanian, Kavitha
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., Mumbai, India
Abstract :
We present a new design of an Embedded Speech Recognition System. It combines the aspects of both hardware and software design to implement a speaker dependent, isolated word, small vocabulary speech recognition system. The feature extraction is based on modified Mel-scaled Frequency Cepstral Coefficients (MFCC) and template matching employs Dynamic Time Warping (DTW). A novel algorithm has been used to improve the detection of start of a word. The hardware is built around the industry standard TMS320LF2407A DSP. The board is designed to serve as a general purpose DSP development board for the 24X series of TI DSPs. It contains, apart from the DSP, the external SRAM, FLASH, ADC interface, I/O interfacing blocks and JTAG interface. Both the hardware and the software have been designed concurrently, with a view to achieve high-speed recognition with maximum accuracy in minimum power and making the device portable. The proposed solution is a low-cost, high-performance, scalable alternative to other existing products.
Keywords :
SRAM chips; analogue-digital conversion; cepstral analysis; digital signal processing chips; embedded systems; feature extraction; hardware-software codesign; speech recognition; ADC interface; DSP; FLASH; I/O interfacing blocks; JTAG interface; Mel scaled frequency cepstral coefficients; SRAM; digital signal processing; dynamic time warping; embedded automatic speech recognition system; feature extraction; hardware design; industry standard; random access memory; software design; speaker dependent system; speed recognition; static RAM; template matching; vocabulary speech recognition system; Automatic speech recognition; Cepstral analysis; Digital signal processing; Feature extraction; Hardware; Mel frequency cepstral coefficient; Random access memory; Software design; Speech recognition; Vocabulary;
Conference_Titel :
VLSI Design, 2004. Proceedings. 17th International Conference on
Print_ISBN :
0-7695-2072-3
DOI :
10.1109/ICVD.2004.1260914