DocumentCode
3317083
Title
Data collection for investigating speech variability in a specific speaker over long and short time periods
Author
Tsuge, Satoru ; Shishibori, Masami ; Ren, Fuji ; Kita, Kenji ; Kuroiwa, Shingo
Author_Institution
Fac. of Eng., Tokushima Univ., Japan
fYear
2005
fDate
30 Oct.-1 Nov. 2005
Firstpage
152
Lastpage
157
Abstract
In this paper, we describe a Japanese speech corpus collected for investigating the speech variability of a specific speaker over short and long time periods. Although speakers use a speaker-dependent speech recognition system, it is known that speech recognition performance varies pending when the utterance was uttered. This is because speech varies even if the speaker utters a specific sentence. However, the relationship between intra-speaker speech variability and speech recognition performance is not clear. We have not seen a corpus of Japanese speech data of a specific speaker over a long time period. Hence, since 2002, we have been collecting speech data for investigating the relationships between speech variability and speech recognition performance. In this paper, we introduce our speech corpus and conduct speech recognition experiments. Experimental results show that the variability of recognition performance over different days is larger than variability of recognition performance within a day.
Keywords
natural languages; speech processing; speech recognition; Japanese speech corpus; speaker-dependent speech recognition system; speech variability; Automatic speech recognition; Background noise; Cellular phones; Data engineering; Databases; Degradation; Information technology; Navigation; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN
0-7803-9361-9
Type
conf
DOI
10.1109/NLPKE.2005.1598725
Filename
1598725
Link To Document