DocumentCode
178316
Title
ASR error detection using recurrent neural network language model and complementary ASR
Author
Yik-Cheung Tam ; Yun Lei ; Jing Zheng ; Wen Wang
Author_Institution
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
fYear
2014
fDate
4-9 May 2014
Firstpage
2312
Lastpage
2316
Abstract
Detecting automatic speech recognition (ASR) errors can play an important role for effective human-computer spoken dialogue system, as recognition errors can hinder accurate system understanding of user intents. Our goal is to locate errors in an utterance so that the dialogue manager can pose appropriate clarification questions to the users. We propose two approaches to improve ASR error detection: (1) using recurrent neural network language models to capture long-distance word context within and across previous utterances; (2) using a complementary ASR system. The intuition is that when two complementary ASR systems disagree on a region in an utterance, this region is most likely an error. We train a neural network predictor of errors using a variety of features. We performed experiments on both English and Iraqi Arabic ASR and observed significant improvement in error detection using the proposed methods.
Keywords
error detection; recurrent neural nets; speech recognition; ASR error detection; English ASR; Iraqi Arabic ASR; automatic speech recognition error detection; complementary ASR system; dialogue manager; human-computer spoken dialogue system; long-distance word context; neural network predictor; recurrent neural network language model; Acoustics; Conferences; Feature extraction; Neural networks; Speech; Speech processing; Speech recognition; ASR error detection; complementary ASR; deep neural network acoustic model; recurrent neural network language model;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854012
Filename
6854012
Link To Document