DocumentCode :
735032
Title :
Low-frequency word enhancement with similar pairs in speech recognition
Author :
Xi Ma ; Xiaoxi Wang ; Dong Wang
Author_Institution :
Center for Speech & Language Technol., Tsinghua Univ., Beijing, China
fYear :
2015
fDate :
12-15 July 2015
Firstpage :
343
Lastpage :
347
Abstract :
In practical automatic speech recognition (ASR) systems, it is difficult to recognize words that are with low-frequency in the language model (LM) training data. Ironically, these words tend to be highly important as they are often domain-specific name entities. In order to meet this challenge, we present a novel approach that enhances the weights of these words by borrowing information from some high-frequency words that are similar to the target words. Experimental results demonstrated that our method can significantly improve ASR performance on low-frequency words and does not impact performance on high-frequency words. Additionally, this method can be easily extended to deal with new words that are absent in the LM training data.
Keywords :
speech recognition; ASR system; LM training data; automatic speech recognition system; domain-specific name entity; language model; low-frequency word enhancement; Acoustics; Hidden Markov models; Probability; Speech; Speech recognition; Training; Training data; finite state transducer; language model; similar pair; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
Type :
conf
DOI :
10.1109/ChinaSIP.2015.7230421
Filename :
7230421
Link To Document :
بازگشت