DocumentCode
3125500
Title
Acoustic modeling for native and non-native Mandarin speech recognition
Author
Xin Chen ; Jian Cheng
Author_Institution
Knowledge Technol., Menlo Park, CA, USA
fYear
2012
fDate
5-8 Dec. 2012
Firstpage
325
Lastpage
329
Abstract
In this paper, we first described the automatic Spoken Chinese Test (SCT). With a large amount of native and non-native data collected for SCT, different training strategies for acoustic modeling were investigated. Evaluations were performed on native as well as non-native datasets. We discovered that directly combining native and non-native data to train acoustic models did not work well, and the acoustic model trained only on native data achieved better performance when applying to non-native speech. We investigated how to use non-native data effectively, and found that Phonetic Decision Tree (PDT) had a great impact. Discriminative training was found to improve speech recognition accuracy effectively for both native and non-native Mandarin speech.
Keywords
acoustic signal processing; decision trees; natural language processing; speech recognition; PDT; SCT; acoustic modeling; automatic spoken Chinese test; discriminative training; nonnative Mandarin speech recognition; nonnative data; phonetic decision tree; speech recognition accuracy; training strategy; Accuracy; Acoustics; Data models; Hidden Markov models; Speech; Speech recognition; Training; Mandarin; acoustic modeling; discriminative training; no-nnative speech recognition; spoken language assessment;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location
Kowloon
Print_ISBN
978-1-4673-2506-6
Electronic_ISBN
978-1-4673-2505-9
Type
conf
DOI
10.1109/ISCSLP.2012.6423544
Filename
6423544
Link To Document