DocumentCode :
3528553
Title :
Efficient combination of likelihood recycling and batch calculation based on conditional fast processing and acoustic back-off
Author :
Ogawa, Atsunori ; Takahashi, Satoshi ; Nakamura, Atsushi
Author_Institution :
NTT Commun. Sci. Labs., Kyoto
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4161
Lastpage :
4164
Abstract :
This paper proposes an efficient combination of state likelihood recycling and batch state likelihood calculation for accelerating acoustic likelihood calculation in an HMM-based speech recognizer. Recycling and batch calculation are each based on different technical approaches, i.e. the former is a purely algorithmic technique while the latter fully exploits PC architecture, and their good acceleration performances are reported in the literatures, respectively. To accelerate the recognition process further by combining them efficiently, we introduce conditional fast processing and acoustic back-off strategies. Our combination algorithm employs the conditional fast processing strategy that is conditioned by two criteria. The first potential activity criterion is used to control not only the recycling of state likelihoods at the current frame but also the precalculation of state likelihoods for several succeeding frames. The second reliability criterion and acoustic back-off are used to control the choice of recycled or batch calculated state likelihoods when they are contradictory in the combination and to prevent word accuracies from degrading. Large vocabulary spontaneous speech recognition experiments using four PCs with different specifications showed that, despite the PC specification dependence, the combined acceleration technique further reduced the total recognition time on all of the PCs.
Keywords :
acoustic signal processing; hidden Markov models; speech recognition; acoustic back-off; batch state likelihood calculation; likelihood recycling; recognition process; reliability criterion; state likelihood recycling; state likelihoods; Acceleration; Context modeling; Decoding; Degradation; Hidden Markov models; Laboratories; Personal communication networks; Recycling; Speech recognition; Vocabulary; acoustic back-off; batch state likelihood calculation; combined acceleration technique; fast acoustic likelihood calculation; state likelihood recycling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960545
Filename :
4960545
Link To Document :
بازگشت