• DocumentCode
    706302
  • Title

    A method to integrate additional knowledge sources into HMM based on junction tree decomposition

  • Author

    Sakti, Sakriani ; Markov, Konstantin ; Nakamura, Satoshi

  • Author_Institution
    Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
  • fYear
    2007
  • fDate
    3-7 Sept. 2007
  • Firstpage
    2404
  • Lastpage
    2408
  • Abstract
    Most current automatic speech recognition (ASR) systems use statistical data-driven methods based on hidden Markov models (HMMs). Although such approaches have proved to be efficient choices, ASR systems often still perform much worse than human listeners, especially in the presence of unexpected acoustic variability. Only a limited level of success can be achieved, by relying only on statistical models and mostly ignoring the additional knowledge available. We propose a new method of integrating various kinds of additional knowledge sources into an HMM-based statistical acoustic model in this paper. We utilized the junction tree algorithm to achieve efficient integration due to increased model complexity. This is since it facilitates the decomposition of the joint probability density function (PDF) into a linked set of local conditional PDFs. This way, a simplified form of the model could be constructed and reliably estimated using limited training data. We evaluated how efficient the proposed method was on an LVCSR task using two different types of accented English speech data. The experimental results revealed that our method improved word accuracy with respect to the standard HMM.
  • Keywords
    hidden Markov models; probability; speech recognition; statistical analysis; trees (mathematics); English speech data; HMM; LVCSR task; PDF; additional knowledge source; automatic speech recognition system; hidden Markov models; junction tree algorithm; junction tree decomposition; limited training data; probability density function; statistical acoustic model; statistical data-driven method; Acoustics; Context; Data models; Hidden Markov models; Junctions; Speech; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2007 15th European
  • Conference_Location
    Poznan
  • Print_ISBN
    978-839-2134-04-6
  • Type

    conf

  • Filename
    7099239