• DocumentCode
    595194
  • Title

    Unsupervised Tibetan speech features Learning based on Dynamic Bayesian Networks

  • Author

    Yue Zhao ; Xiaona Xu ; GuoSheng Yang

  • Author_Institution
    Minzu Univ. of China, Minzu, China
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    2319
  • Lastpage
    2322
  • Abstract
    This paper proposed an unsupervised learning method to learn speech features based on Dynamic Bayesian Networks (DBNs) that accounts for the spatiotemporal dependences in speech signal. Although deep networks have been successfully applied to unsupervised learning features, the structures of the deep networks are often fixed before learning and they fail to capture temporal representation. In this paper, we propose to construct DBNs for unsupervised learning spatial-temporal features from speech data. The experiment results on Tibetan speech data showed the features learned using proposed DBNs outperforms the state-of-art methods in word recognition accuracy.
  • Keywords
    belief networks; feature extraction; signal representation; spatiotemporal phenomena; speech processing; speech recognition; unsupervised learning; word processing; DBN; Tibetan speech feature learning; deep network; dynamic Bayesian network; spatiotemporal dependency; speech signal processing; temporal representation; unsupervised learning method; word recognition; Bayesian methods; Hidden Markov models; Network topology; Speech; Speech recognition; Topology; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460629