DocumentCode
591897
Title
Using syntactic and confusion network structure for out-of-vocabulary word detection
Author
Marin, A. ; Kwiatkowski, Tom ; Ostendorf, Mari ; Zettlemoyer, Luke
Author_Institution
Univ. of Washington, Seattle, WA, USA
fYear
2012
fDate
2-5 Dec. 2012
Firstpage
159
Lastpage
164
Abstract
This paper addresses the problem of detecting words that are out-of-vocabulary (OOV) for a speech recognition system to improve automatic speech translation. The detection system leverages confidence prediction techniques given a confusion network representation and parsing with OOV word tokens to identify spans associated with true OOV words. Working in a resource-constrained domain, we achieve OOV detection F-scores of 60-66 and reduce word error rate by 12% relative to the case where OOV words are not detected.
Keywords
speech recognition; vocabulary; OOV; automatic speech translation; confusion network structure; out-of-vocabulary word detection; resource constrained domain; speech recognition system; syntactic network structure; Error analysis; Grammar; Lattices; Speech; Speech recognition; Syntactics; Vocabulary; OOV detection; parsing; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location
Miami, FL
Print_ISBN
978-1-4673-5125-6
Electronic_ISBN
978-1-4673-5124-9
Type
conf
DOI
10.1109/SLT.2012.6424215
Filename
6424215
Link To Document