Title :
Error type classification and word accuracy estimation using alignment features from word confusion network
Author :
Ogawa, Atsunori ; Hori, Takaaki ; Nakamura, Atsushi
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Abstract :
This paper addresses error type classification in continuous speech recognition (CSR). In CSR, errors are classified into three types, namely, the substitution, insertion and deletion errors, by making an alignment between a recognized word sequence and its reference transcription with a dynamic programming (DP) procedure. We propose a method for deriving such alignment features from a word confusion network (WCN) without using the reference transcription. We show experimentally that the WCN-based alignment features steadily improve the performance of error type classification. They also improve the performance of out-of-vocabulary (OOV) word detection, since OOV word utterances are highly correlated with a particular alignment pattern. In addition, we show that the word accuracy can be estimated from the WCN-based alignment features and more accurately from the error type classification result without using the reference transcription.
Keywords :
dynamic programming; speech recognition; vocabulary; CSR; DP procedure; OOV word detection; WCN-based alignment features; continuous speech recognition; deletion errors; dynamic programming procedure; error type classification; insertion errors; out-of-vocabulary word detection; recognized word sequence; reference transcription; word accuracy estimation; word confusion network; Abstracts; Accuracy; Speech recognition; alignment features; error type classification; word accuracy estimation; word confusion network;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289024