• DocumentCode
    312018
  • Title

    Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs

  • Author

    Kurimo, Mikko ; Somervuo, Panu

  • Author_Institution
    Neural Networks Res. Centre, Helsinki Univ. of Technol., Espoo, Finland
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    358
  • Abstract
    This paper presents methods to improve the probability density estimation in hidden Markov models for phoneme recognition by exploiting the self-organizing map (SOM) algorithm. The advantage of using the SOM is based on the created approximative topology between the mixture densities by training the Gaussian mean vectors used as the kernel centers by the SOM algorithm. The topology allows the neighboring mixtures to respond strongly to the same inputs and so most of the nearest mixtures used to approximate the current observation probability will be found in the topological neighborhood of the “winner” mixture. Also the knowledge about the previous winners are used to speed up the search for the new winners. Tree-search SOMs and segmental SOM training are studied aiming at faster search and suitability for HMM training. The framework for the presented experiments includes mel-cepstrum features and phoneme-wise tied mixture density HMMs
  • Keywords
    Gaussian processes; cepstral analysis; hidden Markov models; learning (artificial intelligence); probability; self-organising feature maps; speech recognition; tree searching; Gaussian mean vectors; SOM; mel-cepstrum features; mixture density hidden Markov model; neighboring mixtures; observation probability; phoneme recognition; phoneme-wise tied mixture density HMM; probability density estimation; searching; segmental SOM training; self-organizing map; speech recognition; training; tree search; Cepstrum; Hidden Markov models; Neural networks; Organizing; Samarium; Spatial databases; Speech recognition; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607128
  • Filename
    607128