DocumentCode :
2595014
Title :
Hardware Implementation of Real-Time Speech Recognition System Using TMS320C6713 DSP
Author :
Manikandan, J. ; Venkataramani, B. ; Girish, K. ; Karthic, H. ; Siddharth, V.
Author_Institution :
Dept. of Electron. & Commun. Eng. (ECE), Nat. Inst. of Technol., Trichy (NITT), Trichy, India
fYear :
2011
fDate :
2-7 Jan. 2011
Firstpage :
250
Lastpage :
255
Abstract :
Continuous, real-time speech recognition is required for various mobile and hands-free applications. In this paper, hardware implementation of real-time speech recognition system is proposed using two approaches and their performances are evaluated. The first approach uses Mel Filter Banks with Mel Frequency Cepstrum Coefficients (MFCC) as feature input and the second approach uses Cochlear Filter Banks with Zero-crossings (ZC) as feature input for recognition. The features extracted from input speech are fed to multi-class Support Vector Machine (SVM) classifier for recognition. The proposed recognition systems are implemented on a Texas Instruments TMS320C6713 floating point digital signal processor for recognizing isolated digits (0-9) and their performances are compared. It is observed that the program memory required for MFCC feature extraction is 44.42% higher than that required for feature extraction using Cochlear filters. Recognition accuracies of 93.33% and 98.67% are achieved for feature inputs from Mel filter banks and Cochlear filter banks respectively. It is also observed that the computational complexity of feature extraction using cochlear filters is 1.53 times of that required for MFCC feature extraction. The recognition performance is also studied for different combinations of test and training utterances. It is found that training using 15 utterances of each digit results in best recognition accuracy. The techniques proposed here can be adapted for various other hands-free consumer applications such as washing machines, hands-free cordless and many more.
Keywords :
channel bank filters; computational complexity; digital signal processing chips; feature extraction; real-time systems; speech recognition; support vector machines; MFCC feature extraction; Mel filter banks; Mel frequency cepstrum coefficients; TMS320C6713 DSP; TMS320C6713 floating point digital signal processor; cochlear filter banks; computational complexity; hands-free applications; hardware implementation; mobile applications; multiclass support vector machine; real-time speech recognition system; zero-crossings; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Real time systems; Speech recognition; Support vector machines; Cochlear Filter Bank; DSP; Feature Extraction; Hands-free applications; MFCC; Mel Filter Bank; Real-time Speech Recognition; Zero crossings;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
VLSI Design (VLSI Design), 2011 24th International Conference on
Conference_Location :
Chennai
ISSN :
1063-9667
Print_ISBN :
978-1-61284-327-8
Electronic_ISBN :
1063-9667
Type :
conf
DOI :
10.1109/VLSID.2011.12
Filename :
5718810
Link To Document :
بازگشت