DocumentCode :
2787570
Title :
Novel CI-backoff scheme for real-time embedded speech recognition
Author :
Ma, Tao ; Deisher, Michael
Author_Institution :
Mississippi State Univ., Starkville, MS, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
1614
Lastpage :
1617
Abstract :
A new method for reduction of computation and memory bandwidth for embedded large vocabulary continuous speech recognition is presented. During the Hidden Markov model state likelihood computation, scores for selected context-dependent (triphone) model states are computed for several frames in advance. Scores that are subsequently needed for Viterbi search but not found in the buffer are replaced by the scores for associated context independent (monophone) models. On the Wall Street Journal 20,000 word continuous speech recognition task, an overall reduction of 58% memory bandwidth and decrease of 23% execution time is achieved relative to an assembly optimized implementation of Sphinx 3. Recognition accuracy is reduced by <;1% while recognition latency is increased by 30 milliseconds.
Keywords :
embedded systems; hidden Markov models; maximum likelihood estimation; speech recognition; vocabulary; CI-backoff scheme; Hidden Markov model; Viterbi search; context independent models; context-dependent model; real-time embedded system; vocabulary continuous speech recognition; Bandwidth; Context modeling; Decoding; Delay; Embedded computing; Hidden Markov models; Load modeling; Probability density function; Speech recognition; Vocabulary; HMM; LVCSR; acoustic modeling; backoff; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494887
Filename :
5494887
Link To Document :
بازگشت