DocumentCode :
3713041
Title :
An open/free database and Benchmark for Uyghur speaker recognition
Author :
Askar Rozi; Dong Wang; Zhiyong Zhang;Thomas Fang Zheng
Author_Institution :
Center for Speech and Language Technologies, Division of Technical Innovation and Development, Tsinghua National Laboratory for Information Science and Technology, China
fYear :
2015
Firstpage :
81
Lastpage :
85
Abstract :
Few research has been conducted on Uyghur speaker recognition. Among the limited works, researchers usually collect small speech databases and publish results based on their own private data. This `close-door evaluation´ makes most of the publications doubtable. This paper publishes an open and free speech database THUYG-20 SRE and a benchmark for Uyghur speaker recognition. The database is based on the THUYG-20 speech corpus we recently released, and the benchmark involves recognition tasks with various training/enrollment/test conditions. We provide a complete description for the database as well as the benchmark, and present an i-vector baseline system constructed using the Kaldi toolkit.
Keywords :
"Speech","Databases","Speaker recognition","Signal to noise ratio","Benchmark testing","Speech recognition","Training"
Publisher :
ieee
Conference_Titel :
Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015 International Conference
Type :
conf
DOI :
10.1109/ICSDA.2015.7357869
Filename :
7357869
Link To Document :
بازگشت