Title :
Speech timing and cross-linguistic studies towards computational human modeling
Author :
Sagisaka, Yoshinori ; Kato, Hiroaki ; Tsuzaki, Minoru ; Nakamura, Shizuka ; Hansakunbuntheung, Chatchawam
Author_Institution :
Language & Speech Sci. Res. Labs., Waseda Univ., Tokyo, Japan
Abstract :
In this paper, we introduce Japanese segmental duration characteristics and computational modeling that we have been studying for around three decades in speech synthesis. A series of experimental results are also shown on loudness dependence in the duration perception. These computational duration modeling and perceptual studies on duration error sensitivity to loudness give some insights for computational human modeling of spoken language capability. As a first trial to figure out how these findings could be efficiently employed in other field like language learning, we introduce our current efforts on the objective evaluation of 2nd language speaking skill and the research consortium of AESOP (Asian English Speech cOrpus Project) where researchers in Asian countries have started to work together.
Keywords :
error statistics; linguistics; speech synthesis; computational human modeling; speech synthesis; speech timing; spoken language capability; Cities and towns; Computational modeling; Humans; Information science; Laboratories; Natural languages; Rhythm; Speech analysis; Speech synthesis; Timing;
Conference_Titel :
Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
Conference_Location :
Urumqi
Print_ISBN :
978-1-4244-4400-7
Electronic_ISBN :
978-1-4244-4400-7
DOI :
10.1109/ICSDA.2009.5278386