DocumentCode :
3660020
Title :
Investigations of low resource multi-accent mandarin speech recognition
Author :
Wei Wang;Wenying Xu;Xiang Sui;Lan Wang;Xunying Liu
Author_Institution :
Key Laboratory of Human Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
fYear :
2015
Firstpage :
62
Lastpage :
66
Abstract :
The Mandarin speech always involves a rich set of regional accents, so that modeling the acoustic variabilities imposed by accents is a challenging task for Mandarin speech recognition. This work investigated using limited accented data to design a multi-accent decision tree, so as to improve the recognition accuracy of traditional GMM-HMM systems. Moreover, the deep neural networks with senone/monophone outputs were used with the multi-accent decision tree based GMM-HMM, in order to build up a robust tandem system. The experiments were evaluated on the database consisting of accented speech collected from seven typical accent regions. The systems designed with the proposed method significantly outperformed conventional GMM-HMMs systems by 1.66% absolute (8.1% relative). The tandem systems trained with DNN and multi-accent decision tree can further reduce the character error rate by 4.64% absolute (24.8% relative), compared to the accent-dependent GMM-HMM system.
Keywords :
"Speech recognition","Acoustics","Hidden Markov models","Decision trees","Speech","Training","Neural networks"
Publisher :
ieee
Conference_Titel :
Information and Automation, 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICInfA.2015.7279259
Filename :
7279259
Link To Document :
بازگشت