DocumentCode :
2705259
Title :
An Evaluation of Lattice Scoring using a Smoothed Estimate of Word Accuracy
Author :
Omar, Mohamed K. ; Mangu, Lidia
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper describes a novel approach for estimating the best hypothesis of a given word lattice, the hypothesis lattice, using another word lattice, the reference lattice, and its application to large vocabulary automatic speech recognition. This approach selects the word sequence in the hypothesis lattice which maximizes a smoothed estimate of the word accuracy with respect to the reference lattice. It is shown in the paper that two algorithms similar to the Viterbi and the forward-backward algorithms can be used to estimate the hypothesis which approximately maximizes this objective function. We present in this paper two setups to test the performance of our approach. In the first setup, only one lattice is used as both the reference and the hypothesis lattices. In the second setup, two lattices produced by different systems are used to calculate the best hypothesis. In each setup, we test our approach on two Arabic broadcast news speech recognition tasks. Compared to the baseline results, up to 2.1% relative improvement in the word error rate (WER) is obtained by using our approach.
Keywords :
error statistics; natural language processing; speech recognition; Arabic broadcast news speech recognition tasks; Viterbi algorithm; forward-backward algorithms; given word lattice; hypothesis lattice; large vocabulary automatic speech recognition; lattice scoring; objective function; reference lattice; smoothed estimation; word accuracy; word error rate; word sequence; Automatic speech recognition; Broadcasting; Decoding; Error analysis; Lattices; Minimization methods; Speech recognition; Testing; Viterbi algorithm; Vocabulary; ASR decoding; Lattice scoring; confusion network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367278
Filename :
4218309
Link To Document :
بازگشت