Title :
Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
Author :
Orillo, John William ; Yap, Roderick ; Sybingco, Edwin
Author_Institution :
De La Salle Univ., Manila, Philippines
Abstract :
In this study, spectral subtraction speech enhancement is integrated to a two word vocabulary speech recognition system to effectively reduce the effects of background noise and increase the recognition rate. The whole system was implemented in FPGA and was modelled in MATLAB. The preprocessing subsystem contains the spectral subtraction algorithm and acoustic front end speech enhancements while the speech recognition subsystem contains the HMM and Viterbi search algorithms. 10 dirty speech samples of word `stop´ and `clockwise´ (sampled at 84 dB) were tested in the speech recognition prototype with varying background noise from 44.6 to 85.4 dB and noise floor (β) from 0.01 to 1. At the end of the testing, the system was able to recognize the two words (stop and clockwise) efficiently with accuracy rate of above 80% until a background noise of 68.6 dB. The best average recognition rate (from 44.6 to 85.4 dB background noise) of 48.5% on the other hand was recorded at 0.01 noise floor. The system without spectral subtraction enhancement was noticed to function efficiently only at 56.6 dB.
Keywords :
field programmable gate arrays; hidden Markov models; search problems; speech enhancement; speech recognition; FPGA; HMM; MATLAB; Viterbi search; background noise reduction; minimum statistics algorithm; noise robust automatic speech recognition; spectral subtraction speech enhancement; two word vocabulary speech recognition; Field programmable gate arrays; Filter banks; Mel frequency cepstral coefficient; Noise; Speech; Speech enhancement; Speech recognition; Spectral Subtraction; Speech Recognition; Viterbi search;
Conference_Titel :
TENCON 2012 - 2012 IEEE Region 10 Conference
Conference_Location :
Cebu
Print_ISBN :
978-1-4673-4823-2
Electronic_ISBN :
2159-3442
DOI :
10.1109/TENCON.2012.6412318