• DocumentCode
    312305
  • Title

    Syntactic-prosodic labeling of large spontaneous speech data-bases

  • Author

    Batliner, A. ; Kompe, R. ; Kiessling, A. ; Niemann, H. ; Nöth, E.

  • Author_Institution
    Inst. fur Deutsche Philologie, Ludwig-Maximilians-Univ., Munchen, Germany
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1720
  • Abstract
    In automatic speech understanding, the division of continuously running speech into syntactic chunks is a great problem. Syntactic boundaries are often marked by prosodic means. For the training of statistic models for prosodic boundaries large databases are necessary. For the German VERBMOBIL project (automatic speech-to-speech translation), we developed a syntactic-prosodic labeling scheme where two main types of boundaries (major syntactic boundaries and syntactically ambiguous boundaries) and some other special boundaries are labeled for a large VERBMOBIL spontaneous speech corpus. We compare the results of classifiers (multilayer perceptrons and language models) trained on these syntactic-prosodic boundary labels with classifiers trained on perceptual-prosodic and pure syntactic labels. The main advantage of the rough syntactic-prosodic labels presented in this paper is that large amounts of data could be labeled within a short time. Therefore, the classifiers trained with these labels turned out to be superior (recognition rates of up to 96%)
  • Keywords
    feedforward neural nets; linguistics; multilayer perceptrons; natural language interfaces; pattern classification; speech processing; statistical analysis; very large databases; VERBMOBIL project; automatic speech understanding; classifiers; continuously running speech; language models; large spontaneous speech databases; multilayer perceptrons; recognition rates; speech-to-speech translation; statistic models; syntactic boundaries; syntactic labels; syntactic-prosodic labeling; training; Bars; Educational technology; History; Labeling; Mirrors; Natural languages; Robustness; Speech; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607959
  • Filename
    607959