• DocumentCode
    3125500
  • Title

    Acoustic modeling for native and non-native Mandarin speech recognition

  • Author

    Xin Chen ; Jian Cheng

  • Author_Institution
    Knowledge Technol., Menlo Park, CA, USA
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    325
  • Lastpage
    329
  • Abstract
    In this paper, we first described the automatic Spoken Chinese Test (SCT). With a large amount of native and non-native data collected for SCT, different training strategies for acoustic modeling were investigated. Evaluations were performed on native as well as non-native datasets. We discovered that directly combining native and non-native data to train acoustic models did not work well, and the acoustic model trained only on native data achieved better performance when applying to non-native speech. We investigated how to use non-native data effectively, and found that Phonetic Decision Tree (PDT) had a great impact. Discriminative training was found to improve speech recognition accuracy effectively for both native and non-native Mandarin speech.
  • Keywords
    acoustic signal processing; decision trees; natural language processing; speech recognition; PDT; SCT; acoustic modeling; automatic spoken Chinese test; discriminative training; nonnative Mandarin speech recognition; nonnative data; phonetic decision tree; speech recognition accuracy; training strategy; Accuracy; Acoustics; Data models; Hidden Markov models; Speech; Speech recognition; Training; Mandarin; acoustic modeling; discriminative training; no-nnative speech recognition; spoken language assessment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423544
  • Filename
    6423544