DocumentCode
134289
Title
Cross-language speech attribute detection and phone recognition for Tibetan using deep learning
Author
Hui Wang ; Yue Zhao ; Yanmin Xu ; Xiaona Xu ; Xingmei Suo ; Qiang Ji
Author_Institution
Dept. of Autom., Minzu Univ. of China, Beijing, China
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
474
Lastpage
477
Abstract
Articulatory features (AFs) are viewed as the universal speech attributes for cross-language speech recognition. They are usually detected using a bank of multi-layer perceptrons (MLPs) in a supervised manner. In this paper, we propose to apply the deep learning method to detect AF-based speech attributes in a semi-supervised manner for cross-language speech recognition. The experimental results on Tibetan phone recognition showed that the deep learning method can detect the AF-based speech attributes more accurately and has higher phone recognition rates than MLPs.
Keywords
learning (artificial intelligence); multilayer perceptrons; speech recognition; AF-based speech attributes; MLP; Tibetan phone recognition; articulatory features; cross-language speech attribute detection; cross-language speech recognition; deep learning method; multilayer perceptrons; phone recognition rates; supervised manner; universal speech attributes; Accuracy; Acoustics; Data models; Detectors; Learning systems; Speech; Speech recognition; Cross-language speech recognition; Tibetan language; articulatory features; deep learning; sparse auto-encoder;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936682
Filename
6936682
Link To Document