• DocumentCode
    3529743
  • Title

    Improved clustered hierarchical tandem system with bottom-up processing

  • Author

    Chang, Shuo-Yiin ; Lee, Lin-shan

  • Author_Institution
    Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4441
  • Lastpage
    4444
  • Abstract
    The outputs of multi-layer perceptron (MLP) classifiers have been successfully used in tandem systems as features for HMM-based automatic speech recognition. In a previous paper, we proposed data-driven clustered hierarchical MLP (CHMLP) tandem system yielding improved performance by dividing the complicated global phone classification problem into simpler hierarchical tasks, in which specialized MLPs are trained to classify small clusters of confusing phones in a hierarchical structure. In this paper a bottom-up processing is further proposed to enhance the classification in the above CHMLP and offer even better performance. MLP rescoring for the tandem system is also investigated. The best result achieved 19.1% relative error reduction over the MFCC baseline.
  • Keywords
    hidden Markov models; multilayer perceptrons; speech recognition; HMM-based automatic speech recognition; bottom-up processing; data-driven clustered hierarchical MLP tandem system; global phone classification problem; improved clustered hierarchical tandem system; multi-layer perceptron classifiers; Artificial neural networks; Automatic speech recognition; Clustering algorithms; Feature extraction; Hidden Markov models; Lattices; Mel frequency cepstral coefficient; Multilayer perceptrons; Neural networks; LVCSR; Neural Network; Tandem system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960615
  • Filename
    4960615