مرکز منطقه ای اطلاع رساني علوم و فناوري - Shared speech attribute augmentation for English-Tibetan cross-language phone recognition

DocumentCode :

3738572

Title :

Shared speech attribute augmentation for English-Tibetan cross-language phone recognition

Author :

Yue Zhao;Nan Zhou;Libing Zhang;Licheng Wu;Rui Zheng;Xiaoyang Wang;Qiang Ji

Author_Institution :

Department of Automation, Minzu University of China, Beijing

fYear :

2015

Firstpage :

539

Lastpage :

543

Abstract :

There has been a challenging research topic on exploring an universal set of speech attributes sharing among a large number of languages for detection-based bottom-up cross-language speech recognition. In some recent research works, articulatory features are used as an universal set of speech attributes shared across many different languages. Since they are defined by human as a set of semantic articulatory descriptions of phones, these manually specified attributes suffer from the incomplete capturing articulation information of all languages and are not distinctive enough for accurate phoneme recognition for cross-language transfer. In this paper, we are solving the problem of a more complete set of articulatory features representation by sparse coding method. We learned the augmented articulatory attributes which sparsely represent more speech articulation information sharing between source and target language. The augmented attributes performed the better accuracy over semantic attributes in our experiments for English-Tibetan cross-language phone recognition.

Keywords :

"Semantics","Speech recognition","Speech","Encoding","Hidden Markov models","Dictionaries","Feature extraction"

Publisher :

ieee

Conference_Titel :

Signal Processing and Information Technology (ISSPIT), 2015 IEEE International Symposium on

Type :

conf

DOI :

10.1109/ISSPIT.2015.7394395

Filename :

7394395

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3738572