DocumentCode :
591897
Title :
Using syntactic and confusion network structure for out-of-vocabulary word detection
Author :
Marin, A. ; Kwiatkowski, Tom ; Ostendorf, Mari ; Zettlemoyer, Luke
Author_Institution :
Univ. of Washington, Seattle, WA, USA
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
159
Lastpage :
164
Abstract :
This paper addresses the problem of detecting words that are out-of-vocabulary (OOV) for a speech recognition system to improve automatic speech translation. The detection system leverages confidence prediction techniques given a confusion network representation and parsing with OOV word tokens to identify spans associated with true OOV words. Working in a resource-constrained domain, we achieve OOV detection F-scores of 60-66 and reduce word error rate by 12% relative to the case where OOV words are not detected.
Keywords :
speech recognition; vocabulary; OOV; automatic speech translation; confusion network structure; out-of-vocabulary word detection; resource constrained domain; speech recognition system; syntactic network structure; Error analysis; Grammar; Lattices; Speech; Speech recognition; Syntactics; Vocabulary; OOV detection; parsing; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424215
Filename :
6424215
Link To Document :
بازگشت