• DocumentCode
    3530198
  • Title

    Automatic prosodic events detection using syllable-based acoustic and syntactic features

  • Author

    Jeon, Je Hun ; Yang Liu

  • Author_Institution
    Comput. Sci. Depatrment, Univ. of Texas at Dallas, Richardson, TX
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4565
  • Lastpage
    4568
  • Abstract
    Automatic prosodic event detection is important for both speech understanding and natural speech synthesis since prosody provides additional information over the short-term segmental features and lexical representation of an utterance. Similar to previous work, this paper focuses on automatic detection of coarse level representation of pitch accents, intonational phrase boundaries (IPB), and break indices. We exploit various classifiers and identify effective feature sets to improve performance of prosodic event detection according to acoustic, lexical, and syntactic evidence. our experiments on the Boston University Radio News Corpus show that the neural network classifier achieves the best performance for modeling acoustic evidence, and that support vector machines are more effective for the lexical and syntactic evidence. The combination of the acoustic and the syntactic models yields 89.8% accent detection accuracy, 93.3% IPB detection accuracy, and 91.1% break index detection accuracy. Compared with previous work, the IPB performance is similar, whereas the results for accent and break index detection are significantly better.
  • Keywords
    neural nets; pattern classification; speech recognition; speech synthesis; acoustic evidence modeling; automatic prosodic events detection; break indices; coarse level representation; intonational phrase boundaries; natural speech synthesis; neural network classifier; pitch accents; speech understanding; support vector machines; syllable-based acoustic; syllable-based syntactic features; utterance lexical representation; Event detection; Prosodic event detection; ToBI annotation; accent; break index; intonational phrase boundary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960646
  • Filename
    4960646