DocumentCode :
179051
Title :
Strategies for Vietnamese keyword search
Author :
Chen, Nancy F. ; Sivadas, Sunil ; Boon Pang Lim ; Hoang Gia Ngo ; Haihua Xu ; Van Tung Pham ; Bin Ma ; Haizhou Li
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
4121
Lastpage :
4125
Abstract :
We propose strategies for a state-of-the-art Vietnamese keyword search (KWS) system developed at the Institute for Infocomm Research (I2R). The KWS system exploits acoustic features characterizing creaky voice quality peculiar to lexical tones in Vietnamese, a minimal-resource transliteration framework to alleviate out-of-vocabulary issues from foreign loan words, and a proposed system combination scheme FusionX. We show that the proposed creaky voice quality features complement pitch-related features, reaching fusion gains of 17.7% relative (6.9% absolute). To the best of our knowledge, the proposed transliteration framework is the first reported rule-based system for Vietnamese; it outperforms statistical-approach baselines up to 14.93-36.73% relative on foreign loan word search tasks. Using FusionX to combine 3 sub-systems, the actual term-weighted value (ATWV) reaches 0.4742, exceeding the ATWV=0.3 benchmark for IARPA Babel participants in the NIST OpenKWSB Evaluation.
Keywords :
information retrieval; knowledge based systems; natural language processing; sensor fusion; speech recognition; ATWV; FusionX scheme; Institute for Infocomm Research; KWS system; NIST OpenKWSB Evaluation; Vietnamese keyword search; acoustic features; actual term-weighted value; creaky voice quality feature; fusion gain; lexical tones; minimal-resource transliteration framework; pitch-related feature; rule-based system; Acoustics; Hidden Markov models; Keyword search; Lattices; Speech; Speech recognition; Training; audio indexing; deep neural networks (DNN); glottalization; large vocabulary continuous speech recognition (LVCSR); low-resourced languages; spoken term detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854377
Filename :
6854377
Link To Document :
بازگشت