DocumentCode :
134289
Title :
Cross-language speech attribute detection and phone recognition for Tibetan using deep learning
Author :
Hui Wang ; Yue Zhao ; Yanmin Xu ; Xiaona Xu ; Xingmei Suo ; Qiang Ji
Author_Institution :
Dept. of Autom., Minzu Univ. of China, Beijing, China
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
474
Lastpage :
477
Abstract :
Articulatory features (AFs) are viewed as the universal speech attributes for cross-language speech recognition. They are usually detected using a bank of multi-layer perceptrons (MLPs) in a supervised manner. In this paper, we propose to apply the deep learning method to detect AF-based speech attributes in a semi-supervised manner for cross-language speech recognition. The experimental results on Tibetan phone recognition showed that the deep learning method can detect the AF-based speech attributes more accurately and has higher phone recognition rates than MLPs.
Keywords :
learning (artificial intelligence); multilayer perceptrons; speech recognition; AF-based speech attributes; MLP; Tibetan phone recognition; articulatory features; cross-language speech attribute detection; cross-language speech recognition; deep learning method; multilayer perceptrons; phone recognition rates; supervised manner; universal speech attributes; Accuracy; Acoustics; Data models; Detectors; Learning systems; Speech; Speech recognition; Cross-language speech recognition; Tibetan language; articulatory features; deep learning; sparse auto-encoder;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936682
Filename :
6936682
Link To Document :
بازگشت