Title :
Multi-level adaptive network for accented Mandarin speech recognition
Author :
Huiyong Wang ; Lan Wang ; Xunying Liu
Author_Institution :
Shenzhen Inst. of Adv. Technol., Univ. of Chinese Acad. of Sci., Shenzhen, China
Abstract :
Accented speech recognition is more challenging than standard speech recognition due to acoustic and linguistic mismatch between standard and accented data. In this paper, we propose a new framework combining Tandem system to improve the discriminative ability of acoustic features with Multi-level Adaptive Network (MLAN) to incorporate information from standard Mandarin corpus and also to solve the data sparseness problem. Mandarin spoken by Guangzhou speakers is considered as the accented mandarin (accented Putonghua, A-PTH), while spoken by northern area as the standard mandarin (standard Putonghua, S-PTH). Significant character error rate reduction of 13.8% and 24.6% relative are obtained over the baseline GMM-HMM systems trained on mixed corpus including both A-PTH and S-PTH corpus, as well as only the A-PTH corpus respectively.
Keywords :
Gaussian processes; error statistics; hidden Markov models; natural language processing; speech recognition; A-PTH corpus; Guangzhou speakers; MLAN; Mandarin spoken; S-PTH corpus; Tandem system; accented Mandarin speech recognition; acoustic features; acoustic mismatch; baseline GMM-HMM systems; character error rate reduction; data sparseness problem; discriminative ability; linguistic mismatch; multilevel adaptive network; standard Mandarin corpus; standard Putonghua; Acoustics; Adaptation models; Hidden Markov models; Speech; Speech recognition; Standards; Training; ASR; Tandem system; accented; mandarin; neural network adaptation;
Conference_Titel :
Information Science and Technology (ICIST), 2014 4th IEEE International Conference on
Conference_Location :
Shenzhen
DOI :
10.1109/ICIST.2014.6920550