DocumentCode
3660020
Title
Investigations of low resource multi-accent mandarin speech recognition
Author
Wei Wang;Wenying Xu;Xiang Sui;Lan Wang;Xunying Liu
Author_Institution
Key Laboratory of Human Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
fYear
2015
Firstpage
62
Lastpage
66
Abstract
The Mandarin speech always involves a rich set of regional accents, so that modeling the acoustic variabilities imposed by accents is a challenging task for Mandarin speech recognition. This work investigated using limited accented data to design a multi-accent decision tree, so as to improve the recognition accuracy of traditional GMM-HMM systems. Moreover, the deep neural networks with senone/monophone outputs were used with the multi-accent decision tree based GMM-HMM, in order to build up a robust tandem system. The experiments were evaluated on the database consisting of accented speech collected from seven typical accent regions. The systems designed with the proposed method significantly outperformed conventional GMM-HMMs systems by 1.66% absolute (8.1% relative). The tandem systems trained with DNN and multi-accent decision tree can further reduce the character error rate by 4.64% absolute (24.8% relative), compared to the accent-dependent GMM-HMM system.
Keywords
"Speech recognition","Acoustics","Hidden Markov models","Decision trees","Speech","Training","Neural networks"
Publisher
ieee
Conference_Titel
Information and Automation, 2015 IEEE International Conference on
Type
conf
DOI
10.1109/ICInfA.2015.7279259
Filename
7279259
Link To Document