DocumentCode :
3167619
Title :
Detection of unseen words in conversational Mandarin
Author :
Bufyko, I. ; Kimball, Owen ; Siu, Man-Hung ; Herrero, José ; Blum, Dan
Author_Institution :
Raytheon BBN Technol., Cambridge, MA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
5181
Lastpage :
5184
Abstract :
We present a Mandarin keyword search system that uses a large vocabulary recognizer to generate consensus networks at various resolutions: word, character, syllable and phone. In order to achieve fast and accurate search, we propose the use of an efficient approximate-match dynamic programming algorithm that finds the best alignment between the target query and the consensus network. Experiments with Mandarin conversational telephone speech show that the approximate-match search improves detection accuracy by more than 10% for rare words that are not present in the recognizer´s dictionary (OOV terms). We also found OOV terms to benefit most from system combination, where we observe a roughly 10% improvement relative to the best single system.
Keywords :
dynamic programming; natural language processing; speech processing; Mandarin conversational telephone speech; OOV terms; approximate-match dynamic programming algorithm; consensus network; spoken term detection; target query; unseen words detection; Decision support systems; Mandarin; OOV; Spoken term detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289087
Filename :
6289087
Link To Document :
بازگشت