• DocumentCode
    2309974
  • Title

    Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection

  • Author

    Tomalin, M. ; Woodland, P.C.

  • Author_Institution
    Dept. of Eng., Cambridge Univ.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    This paper compares the performance of two types of prosodic feature models (PFMs) in a sentence boundary detection task. Specifically, systems are compared that use discriminatively trained Gaussian mixture models (MMI-GMMs) and CART-style decision trees (CDT-PFMs), along with task-specific language models, in a lattice-based decoding framework in order automatically to insert slash unit (SU) boundaries into automatic speech recognition (ASR) transcriptions of input audio files. It is shown that a system which uses MMI-GMMs performs as well as a system that uses conventional CDT-PFMs. In addition, it is shown that, when the CDT-PFM and MMI-GMM systems are combined by taking weighted averages of their respective probability streams, error rate improvements of up to 0.8% abs over the CDT-PFM baseline can be obtained for four different test sets
  • Keywords
    Gaussian processes; decision trees; decoding; probability; speech recognition; CART-style decision trees; automatic speech recognition transcriptions; discriminatively trained Gaussian mixture models; input audio files; lattice-based decoding framework; prosodic feature models; sentence boundary detection; slash unit boundaries; task-specific language models; Automatic speech recognition; Decision trees; Decoding; Ear; Error analysis; Model driven engineering; Natural languages; Streaming media; System testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660079
  • Filename
    1660079