Acoustic Modelling Using Continuous Rational Kernels

Author

Layton, M.I. ; Gales, M.J.F.

Author_Institution

Dept. of Eng., Cambridge Univ.

fYear

2005

fDate

28-28 Sept. 2005

Firstpage

165

Lastpage

170

Abstract

There has been significant interest in developing alternatives to hidden Markov models (HMMs) for speech recognition. In particular, interest has been focused upon models that allow additional dependencies to be incorporated. One such model is the augmented statistical model. Here a local exponential approximation, based upon derivatives of a base distribution, is made about some distribution of the base model. Augmented statistical models can be trained using a maximum margin criterion, which may be implemented using an SVM with a generative kernel. Calculating derivatives of the base distribution, in particular higher-order derivatives, to form the generative kernel requires complex dynamic programming algorithms. In this paper a new form of rational kernel, a continuous rational kernel is proposed. This allows elements of the generative kernel, including those based on higher-order derivatives, to be computed using standard forms of transducer within a rational kernel framework. In addition, the derivatives are shown to be a principled method of defining marginalised kernels. Continuous rational kernels are evaluated using a large vocabulary continuous speech recognition (LVCSR) task

Keywords

acoustic signal detection; dynamic programming; hidden Markov models; rational functions; speech recognition; statistical analysis; support vector machines; acoustic modelling; augmented statistical model; continuous rational kernels; dynamic programming; generative kernel; hidden Markov models; higher-order derivative; large vocabulary continuous speech recognition; local exponential approximation; support vector machines; Biological system modeling; Dynamic programming; Heuristic algorithms; Hidden Markov models; Kernel; Maximum likelihood estimation; Sequences; Speech recognition; Support vector machines; Transducers;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning for Signal Processing, 2005 IEEE Workshop on

Conference_Location

Mystic, CT

Print_ISBN

0-7803-9517-4

Type

conf

DOI

10.1109/MLSP.2005.1532893

Filename

1532893