DocumentCode :
2406989
Title :
CECOS: A Chinese-English code-switching speech database
Author :
Shen, Han-Ping ; Wu, Chung-Hsien ; Yang, Yan-Ting ; Hsu, Chun-Shan
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear :
2011
fDate :
26-28 Oct. 2011
Firstpage :
120
Lastpage :
123
Abstract :
With the increase on the demands for code-switching automatic speech recognition (ASR), the design and development of a code-switching speech database becomes highly desirable. However, it is not easy to collect sufficient code-switched utterances for model training for code-switching ASR. This study presents the procedure and experience for the design and development of a Chinese-English COde-switching Speech database (CECOS). Two different methods for collecting Chinese-English code-switched utterances are employed in this work. The applications of the collected database are also introduced. The CECOS database not only contains the speech data with code-switch properties but also accents due to non-native speakers. This database can be applied to several applications, such as code-switching speech recognition, language identification, named entity detection, etc.
Keywords :
audio databases; natural languages; speech recognition; CECOS; Chinese-English code-switching speech database; code-switching automatic speech recognition; language identification; named entity detection; nonnative speakers; Databases; Speech; Speech coding; Speech processing; Speech recognition; Switches; Vocabulary; automatic speech recognition; code-switching; multilingual speech database;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4577-0930-2
Type :
conf
DOI :
10.1109/ICSDA.2011.6085992
Filename :
6085992
Link To Document :
بازگشت