Title :
System and keyword dependent fusion for spoken term detection
Author :
Van Tung Pham ; Chen, Nancy F. ; Sivadas, Sunil ; Haihua Xu ; I-Fan Chen ; Chongjia Ni ; Eng Siong Chng ; Haizhou Li
Author_Institution :
Nanyang Technol. Univ., Singapore, Singapore
Abstract :
System combination (or data fusion1) is known to provide significant improvement for spoken term detection (STD). The key issue of the system combination is how to effectively fuse the various scores of participant systems. Currently, most system combination methods are system and keyword independent, i.e. they use the same arithmetic functions to combine scores for all keywords. Although such strategy improve keyword search performance, the improvement is limited. In this paper we first propose an arithmetic-based system combination method to incorporate the system and keyword characteristics into the fusion procedure to enhance the effectiveness of system combination. The method incorporates a system-keyword dependent property, which is the number of acceptances in this paper, into the combination procedure. We then introduce a discriminative model to combine various useful system and keyword characteristics into a general framework. Improvements over standard baselines are observed on the Vietnamese data from IARPA Babel program with the NIST OpenKWS13 Evaluation setup.
Keywords :
sensor fusion; speech recognition; IARPA Babel program; NIST OpenKWS13 Evaluation setup; STD improvement; Vietnamese data; arithmetic-based system combination method; data fusion; discriminative model; keyword characteristics; keyword dependent fusion; keyword search performance improvement; spoken term detection improvement; system characteristics; system dependent fusion; system score fusion; system-keyword dependent property; Feature extraction; Hidden Markov models; Keyword search; NIST; Reliability; Speech; Training; discriminative modeling; keyword search; spoken term detection (STD); system combination;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078613