DocumentCode
1686994
Title
ASR error segment localization for spoken recovery strategy
Author
Bechet, Frederic ; Favre, Benoit
Author_Institution
LIF, Aix-Marseille Univ., Marseille, France
fYear
2013
Firstpage
6837
Lastpage
6841
Abstract
Even though small ASR errors might not impact downstream processes that make use of the transcript, larger error segments like those generated by OOVs can have a considerable impact on applications such as speech-to-speech translation and can eventually lead to communication failure between users of the system. This work focuses on error detection in ASR output targeted towards significant error segments that can be recovered using a dialog system. We propose a CRF system trained to recognize error segments with ASR confidence-based, lexical and syntactic features. The most significant error segment is passed to a dialog system for interactive recovery in which rephrased words are reinserted in the original. 22% of utterances can be fully recovered and an interesting by-product is that rewriting error segments as a single token reduces WER by 17% on an adverse corpus.
Keywords
speech recognition; ASR error segment localization; CRF system; automatic speech recognition; communication failure; dialog system; downstream process; error detection; interactive recovery; speech-to-speech translation; spoken recovery strategy; Computational linguistics; Error correction; Lattices; Measurement uncertainty; Speech; Speech recognition; Syntactics; Automatic Speech Recognition; Confidence Measure; Error Detection; Speech to Speech translation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6638986
Filename
6638986
Link To Document