• DocumentCode
    337438
  • Title

    Irrelevant variability normalization in learning HMM state tying from data based on phonetic decision-tree

  • Author

    Huo, Qiang ; Ma, Bin

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., Hong Kong
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    577
  • Abstract
    We propose to apply the concept of irrelevant variability normalization to the general problem of learning structure from data. Because of the problems of a diversified training data set and/or possible acoustic mismatches between training and testing conditions, the structure learned from the training data by using a maximum likelihood training method will not necessarily generalize well on mismatched tasks. We apply the above concept to the structural learning problem of phonetic decision-tree based hidden Markov model (HMM) state tying. We present a new method that integrates a linear-transformation based normalization mechanism into the decision-tree construction process to make the learned structure have a better modeling capability and generalizability. The viability and efficacy of the proposed method are confirmed in a series of experiments for continuous speech recognition of Mandarin Chinese
  • Keywords
    decision trees; hidden Markov models; learning systems; maximum likelihood estimation; natural languages; speech recognition; HMM state tying; Mandarin Chinese; acoustic mismatches; continuous speech recognition; diversified training data set; experiments; hidden Markov model; irrelevant variability normalization; learned structure; linear-transformation based normalization; maximum likelihood training method; phonetic decision-tree; structural learning problem; structure learning; testing condition; training condition; training data; Acoustic testing; Automatic speech recognition; Buildings; Computer science; Hidden Markov models; Information systems; Maximum likelihood estimation; Robustness; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759732
  • Filename
    759732