Approximate Test Risk Bound Minimization Through Soft Margin Estimation

Author

Li, Jinyu ; Yuan, Ming ; Lee, Chin-Hui

Author_Institution

Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA

Volume

15

Issue

8

fYear

2007

Firstpage

2393

Lastpage

2404

Abstract

Inspired by the great success of margin-based classifiers, there is a trend to incorporate the margin concept into hidden Markov modeling for speech recognition. Several attempts based on margin maximization were proposed recently. In this paper, a new discriminative learning framework, called soft margin estimation (SME), is proposed for estimating the parameters of continuous-density hidden Markov models. The proposed method makes direct use of the successful ideas of soft margin in support vector machines to improve generalization capability and decision feedback learning in minimum classification error training to enhance model separation in classifier design. SME is illustrated from a perspective of statistical learning theory. By including a margin in formulating the SME objective function, SME is capable of directly minimizing an approximate test risk bound. Frame selection, utterance selection, and discriminative separation are unified into a single objective function that can be optimized using the generalized probabilistic descent algorithm. Tested on the TIDIGITS connected digit recognition task, the proposed SME approach achieves a string accuracy of 99.43%. On the 5 k-word Wall Street Journal task, SME obtains relative word error rate reductions of about 10% over our best baseline results in different experimental configurations. We believe this is the first attempt to show the effectiveness of margin-based acoustic modeling for large vocabulary continuous speech recognition in a hidden Markov model framework. Further improvements are expected because the approximate test risk bound minimization principle offers a flexible and rigorous framework to facilitate incorporation of new margin-based optimization criteria into hidden Markov model training.

Keywords

feedback; generalisation (artificial intelligence); hidden Markov models; learning (artificial intelligence); minimisation; parameter estimation; signal classification; speech recognition; support vector machines; approximate test risk bound minimization; continuous-density hidden Markov models; decision feedback learning; generalization capability; minimum classification error training; optimization; parameter estimation; soft margin estimation; speech recognition; support vector machines; Error analysis; Feedback; Hidden Markov models; Machine learning; Parameter estimation; Speech recognition; Statistical learning; Support vector machine classification; Support vector machines; Testing; Discriminative training (DT); soft margin estimation (SME); statistical learning; test risk;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TASL.2007.906178

Filename

4317566