Title :
Acoustic modeling for native and non-native Mandarin speech recognition
Author :
Xin Chen ; Jian Cheng
Author_Institution :
Knowledge Technol., Menlo Park, CA, USA
Abstract :
In this paper, we first described the automatic Spoken Chinese Test (SCT). With a large amount of native and non-native data collected for SCT, different training strategies for acoustic modeling were investigated. Evaluations were performed on native as well as non-native datasets. We discovered that directly combining native and non-native data to train acoustic models did not work well, and the acoustic model trained only on native data achieved better performance when applying to non-native speech. We investigated how to use non-native data effectively, and found that Phonetic Decision Tree (PDT) had a great impact. Discriminative training was found to improve speech recognition accuracy effectively for both native and non-native Mandarin speech.
Keywords :
acoustic signal processing; decision trees; natural language processing; speech recognition; PDT; SCT; acoustic modeling; automatic spoken Chinese test; discriminative training; nonnative Mandarin speech recognition; nonnative data; phonetic decision tree; speech recognition accuracy; training strategy; Accuracy; Acoustics; Data models; Hidden Markov models; Speech; Speech recognition; Training; Mandarin; acoustic modeling; discriminative training; no-nnative speech recognition; spoken language assessment;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423544