DocumentCode
735032
Title
Low-frequency word enhancement with similar pairs in speech recognition
Author
Xi Ma ; Xiaoxi Wang ; Dong Wang
Author_Institution
Center for Speech & Language Technol., Tsinghua Univ., Beijing, China
fYear
2015
fDate
12-15 July 2015
Firstpage
343
Lastpage
347
Abstract
In practical automatic speech recognition (ASR) systems, it is difficult to recognize words that are with low-frequency in the language model (LM) training data. Ironically, these words tend to be highly important as they are often domain-specific name entities. In order to meet this challenge, we present a novel approach that enhances the weights of these words by borrowing information from some high-frequency words that are similar to the target words. Experimental results demonstrated that our method can significantly improve ASR performance on low-frequency words and does not impact performance on high-frequency words. Additionally, this method can be easily extended to deal with new words that are absent in the LM training data.
Keywords
speech recognition; ASR system; LM training data; automatic speech recognition system; domain-specific name entity; language model; low-frequency word enhancement; Acoustics; Hidden Markov models; Probability; Speech; Speech recognition; Training; Training data; finite state transducer; language model; similar pair; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location
Chengdu
Type
conf
DOI
10.1109/ChinaSIP.2015.7230421
Filename
7230421
Link To Document