• DocumentCode
    3422913
  • Title

    A novel algorithm for unsupervised prosodic language model adaptation

  • Author

    Ananthakrishnan, Sankaranarayanan ; Narayanan, Shrikanth

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4181
  • Lastpage
    4184
  • Abstract
    Symbolic representations of prosodic events have been shown to be useful for spoken language applications such as speech recognition. However, a major drawback with categorical prosody models is their lack of scalability due to the difficulty in annotating large corpora with prosodic tags for training. In this paper, we present a novel, unsupervised adaptation technique for bootstrapping categorical prosodic language models (PLMs) from a small, annotated training set. Our experiments indicate that the adaptation algorithm significantly improves the quality and coverage of the PLM. On a test set derived from the Boston University Radio News corpus, the adapted PLM gave a relative improvement of 13.8% over the seed PLM on the binary pitch accent detection task, while reducing the OOV rate by 16.5% absolute.
  • Keywords
    natural language processing; speech processing; bootstrapping categorical prosodic language models; spoken language applications; symbolic representations; unsupervised prosodic language model adaptation; Adaptation model; Automatic speech recognition; Labeling; Laboratories; Lattices; Natural languages; Speech analysis; Speech recognition; Stress; Testing; lattice posterior; pitch accent; prosodic language model; prosody; unsupervised adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518576
  • Filename
    4518576