DocumentCode :
2516499
Title :
Improved Mandarin Keyword Spotting Using Confusion Garbage Model
Author :
Zhang, Shilei ; Shuang, Zhiwei ; Shi, Qin ; Qin, Yong
Author_Institution :
IBM Res.-China, Beijing, China
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
3700
Lastpage :
3703
Abstract :
This paper presents an improved acoustic keyword spotting (KWS) algorithm using a novel confusion garbage model in Mandarin conversational speech. Observing the KWS corpus, we found there are many words with similar pronunciation with predefined keywords, although they have different Chinese characters and different meanings, which easily result in high false alarm rate. In this paper, an improved acoustic KWS method with confusion garbage models was developed that absorbs similar pronunciation words confused with specific keywords for a given task. One obvious advantage of such method is that it provides a flexible framework to implement the selection procedure and reduce false alarm rate effectively for a specific task. The efficiency of the proposed architecture was evaluated under HMM-based confidence measures (CM) methods and demonstrated on a conversational telephone dataset.
Keywords :
hidden Markov models; natural language processing; speech recognition; Chinese characters; HMM-based CM method; HMM-based confidence measure method; Mandarin conversational speech; Mandarin keyword spotting; acoustic KWS method; acoustic keyword spotting; confusion garbage model; conversational telephone dataset; false alarm rate; hidden Markov models; similar pronunciation words; speech recognition; Acoustics; Computational modeling; Decoding; Hidden Markov models; Speech; Speech recognition; Vocabulary; confidence measure; confusion garbage model; keyword spotting; similar pronunciation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.901
Filename :
5597890
Link To Document :
بازگشت