• DocumentCode
    3486123
  • Title

    Detection-based accented speech recognition using articulatory features

  • Author

    Zhang, Chao ; Liu, Yi ; Lee, Chin-Hui

  • Author_Institution
    Center for Speech & Language Technol., Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    500
  • Lastpage
    505
  • Abstract
    We propose an attribute-based approach to accented speech recognition based on automatic speech attribute transcription with high efficiency detection of articulatory features. In order to utilize appropriate and extensible phonetic and linguistic knowledge, conditional random field (CRF) is designed to take frame-level inputs with binary feature functions. The use of CRF with merely the state features to generate probabilistic phone lattices is then utilized to solve the phone under-generation problem. Finally an attribute discrimination module is incorporated to handle a diversity of accent changes without retraining any model, leading to flexible “plug `n´ play” modular design. The effectiveness of the proposed approach is evaluated on three typical Chinese accents, namely Guanhua, Yue and Wu. Our method yields a significant absolute phone recognition accuracy improvement 5.04%, 4.68% and 6.06% for the corresponding three accent types over a conventional monophone HMM system. Compared to a context-dependent triphone HMM system, we achieve comparable phone accuracies at only less than 20% of the computation cost. In addition, our proposed method is equally applicable to speaker-independent systems handling multiple accents.
  • Keywords
    speech; speech recognition; articulatory features; automatic speech attribute transcription; binary feature functions; conditional random field; detection-based accented speech recognition; probabilistic phone lattices; speaker-independent systems; Acoustics; Corporate acquisitions; Detectors; Feature extraction; Hidden Markov models; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163982
  • Filename
    6163982