Title :
Criteria for database and tool design for speech timing analysis with special reference to mandarin
Author :
Yu Jue ; Gibbon, D.
Author_Institution :
Sch. of Humanities, Zhejiang Univ., Hangzhou, China
Abstract :
This position paper investigates some of the problems in modelling speech timing for the design of speech databases and corpus analysis tools for phonetics and speech technology. First we examine a selection of phonetic approaches to speech timing analysis, the so-called `rhythm metrics´, and focus on explaining (1) inconsistencies (varying results for the same language) and (2) the failure to model rhythmic alternation. To overcome these problems we present a new perspective on the phonetic identification of rhythm patterns as a special case of duration modelling, including the additional criterion of alternation. We describe the Rhythm Parser, a tool for identifying hierarchical alternating patterns, and discuss results from applying it.
Keywords :
grammars; natural language processing; speech processing; text analysis; Mandarin text; corpus analysis tool design; duration modelling; focus condition; hierarchical rhythm alternation pattern phonetic identification; phonetic approach selection; rhythm metrics; rhythm parser tool; rhythmic alternation criterion; speech database tool design; speech timing analysis; Acceleration; Indexes; Rhythm; Speech; Stress; Timing; bottom-up analysis; peak unit; rhythm metric; speech corpus; speech timing; timing hierarchy;
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2811-1
Electronic_ISBN :
978-1-4673-2812-8
DOI :
10.1109/ICSDA.2012.6422453