Title :
ASR error detection using recurrent neural network language model and complementary ASR
Author :
Yik-Cheung Tam ; Yun Lei ; Jing Zheng ; Wen Wang
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
Abstract :
Detecting automatic speech recognition (ASR) errors can play an important role for effective human-computer spoken dialogue system, as recognition errors can hinder accurate system understanding of user intents. Our goal is to locate errors in an utterance so that the dialogue manager can pose appropriate clarification questions to the users. We propose two approaches to improve ASR error detection: (1) using recurrent neural network language models to capture long-distance word context within and across previous utterances; (2) using a complementary ASR system. The intuition is that when two complementary ASR systems disagree on a region in an utterance, this region is most likely an error. We train a neural network predictor of errors using a variety of features. We performed experiments on both English and Iraqi Arabic ASR and observed significant improvement in error detection using the proposed methods.
Keywords :
error detection; recurrent neural nets; speech recognition; ASR error detection; English ASR; Iraqi Arabic ASR; automatic speech recognition error detection; complementary ASR system; dialogue manager; human-computer spoken dialogue system; long-distance word context; neural network predictor; recurrent neural network language model; Acoustics; Conferences; Feature extraction; Neural networks; Speech; Speech processing; Speech recognition; ASR error detection; complementary ASR; deep neural network acoustic model; recurrent neural network language model;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854012