DocumentCode
114256
Title
Multi-level adaptive network for accented Mandarin speech recognition
Author
Huiyong Wang ; Lan Wang ; Xunying Liu
Author_Institution
Shenzhen Inst. of Adv. Technol., Univ. of Chinese Acad. of Sci., Shenzhen, China
fYear
2014
fDate
26-28 April 2014
Firstpage
602
Lastpage
605
Abstract
Accented speech recognition is more challenging than standard speech recognition due to acoustic and linguistic mismatch between standard and accented data. In this paper, we propose a new framework combining Tandem system to improve the discriminative ability of acoustic features with Multi-level Adaptive Network (MLAN) to incorporate information from standard Mandarin corpus and also to solve the data sparseness problem. Mandarin spoken by Guangzhou speakers is considered as the accented mandarin (accented Putonghua, A-PTH), while spoken by northern area as the standard mandarin (standard Putonghua, S-PTH). Significant character error rate reduction of 13.8% and 24.6% relative are obtained over the baseline GMM-HMM systems trained on mixed corpus including both A-PTH and S-PTH corpus, as well as only the A-PTH corpus respectively.
Keywords
Gaussian processes; error statistics; hidden Markov models; natural language processing; speech recognition; A-PTH corpus; Guangzhou speakers; MLAN; Mandarin spoken; S-PTH corpus; Tandem system; accented Mandarin speech recognition; acoustic features; acoustic mismatch; baseline GMM-HMM systems; character error rate reduction; data sparseness problem; discriminative ability; linguistic mismatch; multilevel adaptive network; standard Mandarin corpus; standard Putonghua; Acoustics; Adaptation models; Hidden Markov models; Speech; Speech recognition; Standards; Training; ASR; Tandem system; accented; mandarin; neural network adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Technology (ICIST), 2014 4th IEEE International Conference on
Conference_Location
Shenzhen
Type
conf
DOI
10.1109/ICIST.2014.6920550
Filename
6920550
Link To Document