Use of word level side information to improve speech recognition

Author

Vergyri, Dimitra

Author_Institution

Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA

Volume

3

fYear

2000

fDate

2000

Firstpage

1823

Abstract

Word level information obtained from the output of a speech recognizer has been used in the past to extract confidence features for the hypothesized words. This work describes a post-recognition process which treats these word-level features as independent knowledge sources and combines them in one log linear model for the posterior probability of a word sequence. This model is used for rescoring the hypotheses. The parameters of the model are optimized using a discriminative model combination approach, where a simplex optimization method, known as amoeba search, is used to minimize the non-smooth function of empirical error rate on training data. The method is evaluated on the SWITCHBOARD database. After training 20 new parameters, we obtain a significant word error rate reduction over the baseline system. A correlation measure between features and word accuracy is defined to help analyze and explain the results

Keywords

feature extraction; optimisation; search problems; speech recognition; SWITCHBOARD database; amoeba search; confidence features extraction; correlation measure; discriminative model combination approach; empirical error rate; hypotheses rescoring; independent knowledge sources; log linear model; model parameters; non-smooth function minimization; post-recognition process; posterior probability; simplex optimization method; speech recognition; training data; word accuracy; word error rate reduction; word level side information; word sequence; word-level features; Adaptation model; Data mining; Error analysis; Feature extraction; Natural languages; Optimization methods; Spatial databases; Speech processing; Speech recognition; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location

Istanbul

ISSN

1520-6149

Print_ISBN

0-7803-6293-4

Type

conf

DOI

10.1109/ICASSP.2000.862109

Filename

862109