DocumentCode :
3738572
Title :
Shared speech attribute augmentation for English-Tibetan cross-language phone recognition
Author :
Yue Zhao;Nan Zhou;Libing Zhang;Licheng Wu;Rui Zheng;Xiaoyang Wang;Qiang Ji
Author_Institution :
Department of Automation, Minzu University of China, Beijing
fYear :
2015
Firstpage :
539
Lastpage :
543
Abstract :
There has been a challenging research topic on exploring an universal set of speech attributes sharing among a large number of languages for detection-based bottom-up cross-language speech recognition. In some recent research works, articulatory features are used as an universal set of speech attributes shared across many different languages. Since they are defined by human as a set of semantic articulatory descriptions of phones, these manually specified attributes suffer from the incomplete capturing articulation information of all languages and are not distinctive enough for accurate phoneme recognition for cross-language transfer. In this paper, we are solving the problem of a more complete set of articulatory features representation by sparse coding method. We learned the augmented articulatory attributes which sparsely represent more speech articulation information sharing between source and target language. The augmented attributes performed the better accuracy over semantic attributes in our experiments for English-Tibetan cross-language phone recognition.
Keywords :
"Semantics","Speech recognition","Speech","Encoding","Hidden Markov models","Dictionaries","Feature extraction"
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2015 IEEE International Symposium on
Type :
conf
DOI :
10.1109/ISSPIT.2015.7394395
Filename :
7394395
Link To Document :
بازگشت