DocumentCode :
940402
Title :
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence
Author :
Ananthakrishnan, Sankaranarayanan ; Narayanan, Shrikanth S.
Author_Institution :
Signal & Image Process. Inst. (SIPI), Univ. of Southern California, Los Angeles, CA
Volume :
16
Issue :
1
fYear :
2008
Firstpage :
216
Lastpage :
228
Abstract :
With the advent of prosody annotation standards such as tones and break indices (ToBI), speech technologists and linguists alike have been interested in automatically detecting prosodic events in speech. This is because the prosodic tier provides an additional layer of information over the short-term segment-level features and lexical representation of an utterance. As the prosody of an utterance is closely tied to its syntactic and semantic content in addition to its lexical content, knowledge of the prosodic events within and across utterances can assist spoken language applications such as automatic speech recognition and translation. On the other hand, corpora annotated with prosodic events are useful for building natural-sounding speech synthesizers. In this paper, we build an automatic detector and classifier for prosodic events in American English, based on their acoustic, lexical, and syntactic correlates. Following previous work in this area, we focus on accent (prominence, or ldquostressrdquo) and prosodic phrase boundary detection at the syllable level. Our experiments achieved a performance rate of 86.75% agreement on the accent detection task, and 91.61% agreement on the phrase boundary detection task on the Boston University Radio News Corpus.
Keywords :
acoustic signal processing; natural languages; signal classification; speech processing; speech recognition; speech synthesis; American English; acoustic evidence; automatic prosodic event classifier; automatic speech prosodic event detection; automatic speech recognition; automatic speech translation; lexical evidence; natural-sounding speech synthesizer; prosodic phrase boundary detection; prosody annotation standard; spoken language processing; syntactic evidence; Accent; prominence; prosodic phrase boundary; prosody recognition; prosody–syntax interface; prosody-syntax interface; spoken language processing; stress;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.907570
Filename :
4358088
Link To Document :
بازگشت