DocumentCode
1783883
Title
Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information
Author
Xuyang Wang ; Ta Li ; Pengyuan Zhang ; Jielin Pan ; Yonghong Yan
Author_Institution
Key Lab. of Speech Acoust. & Content Understanding, Beijing, China
fYear
2014
fDate
27-29 Aug. 2014
Firstpage
594
Lastpage
597
Abstract
The detection of Out-of-vocabulary (OOV) words is a crucial problem for spoken term detection (STD). In this paper, the use of integration with local acoustic information is investigated to retrieve more OOV words. Tokens with high local acoustic probabilities propagated in the search space at the decoding stage will be forced to propagate to the next frame. In this way, acoustic similar words can be reserved in recognition results without considering of language model probabilities. Experimental results show that this new approach results in a significant increase in the performance of OOV words detection. At least a relative improvement of 8.5% in equal error rate is achieved over the baseline system. Meanwhile, it will do no harm to in-vocabulary (IV) words detection. With some refinement of beam pruning, the decoding time only rises 3% relative to the baseline system.
Keywords
acoustic signal processing; probability; speech recognition; vocabulary; word processing; OOV word detection; STD; baseline system; beam pruning; decoding time; enhanced out-of-vocabulary word detection; equal error rate; in-vocabulary word detection; language model probabilities; local acoustic information; local acoustic probabilities; spoken term detection; Acoustic beams; Acoustics; Decoding; Hidden Markov models; Signal processing; Speech; Speech recognition; OOV; spoken term detection; token passing;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014 Tenth International Conference on
Conference_Location
Kitakyushu
Print_ISBN
978-1-4799-5389-9
Type
conf
DOI
10.1109/IIH-MSP.2014.154
Filename
6998399
Link To Document