• DocumentCode
    2414529
  • Title

    Acoustic Modelling Using Continuous Rational Kernels

  • Author

    Layton, M.I. ; Gales, M.J.F.

  • Author_Institution
    Dept. of Eng., Cambridge Univ.
  • fYear
    2005
  • fDate
    28-28 Sept. 2005
  • Firstpage
    165
  • Lastpage
    170
  • Abstract
    There has been significant interest in developing alternatives to hidden Markov models (HMMs) for speech recognition. In particular, interest has been focused upon models that allow additional dependencies to be incorporated. One such model is the augmented statistical model. Here a local exponential approximation, based upon derivatives of a base distribution, is made about some distribution of the base model. Augmented statistical models can be trained using a maximum margin criterion, which may be implemented using an SVM with a generative kernel. Calculating derivatives of the base distribution, in particular higher-order derivatives, to form the generative kernel requires complex dynamic programming algorithms. In this paper a new form of rational kernel, a continuous rational kernel is proposed. This allows elements of the generative kernel, including those based on higher-order derivatives, to be computed using standard forms of transducer within a rational kernel framework. In addition, the derivatives are shown to be a principled method of defining marginalised kernels. Continuous rational kernels are evaluated using a large vocabulary continuous speech recognition (LVCSR) task
  • Keywords
    acoustic signal detection; dynamic programming; hidden Markov models; rational functions; speech recognition; statistical analysis; support vector machines; acoustic modelling; augmented statistical model; continuous rational kernels; dynamic programming; generative kernel; hidden Markov models; higher-order derivative; large vocabulary continuous speech recognition; local exponential approximation; support vector machines; Biological system modeling; Dynamic programming; Heuristic algorithms; Hidden Markov models; Kernel; Maximum likelihood estimation; Sequences; Speech recognition; Support vector machines; Transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2005 IEEE Workshop on
  • Conference_Location
    Mystic, CT
  • Print_ISBN
    0-7803-9517-4
  • Type

    conf

  • DOI
    10.1109/MLSP.2005.1532893
  • Filename
    1532893