Title :
Fine-grained pitch accent and boundary tone labeling with parametric F0 features
Author :
Ananthakrishnan, Sankaranarayanan ; Narayanan, Shrikanth
Author_Institution :
Dept. of Electr. Eng., Southern California Univ., Los Angeles, CA
fDate :
March 31 2008-April 4 2008
Abstract :
Motivated by linguistic theories of prosodic categoricity, symbolic representations of prosody have recently attracted the attention of speech technologists. Categorical representations such as ToBI not only bear linguistic relevance, but also have the advantage that they can be easily modeled and integrated within applications. Since manual labeling of these categories is time-consuming and expensive, there has been significant interest in automatic prosody labeling. This paper presents a fine-grained ToBI-style prosody labeling system that makes use of features derived from RFC and TILT parameterization of FO together with a n-gram prosodic language model for 4-way pitch accent labeling and 2-way boundary tone labeling. For this task, our system achieves pitch accent labeling accuracy of 56.4% and boundary tone labeling accuracy of 67.7% on the Boston University Radio News Corpus.
Keywords :
speech processing; boundary tone labeling; categorical representations; fine-grained ToBI-style prosody labeling system; fine-grained pitch accent; n-gram prosodic language model; pitch accent labeling; prosodic categoricity; symbolic representations; Humans; Labeling; Laboratories; Large-scale systems; Natural languages; Speech analysis; Speech synthesis; Standards development; Testing; Viterbi algorithm; RFC; TILT; ToBI; boundary tone; pitch accent; prosody;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518667