• DocumentCode
    312205
  • Title

    Explicit segmentation of speech using Gaussian models

  • Author

    Bonafonte, Antonio ; Nogueiras, Albino ; Rodriguez-Garrido, Antonio

  • Author_Institution
    Univ. Politecnica de Catalunya, Barcelona, Spain
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1269
  • Abstract
    The authors investigate an automatic method to segment labeled speech. The method needs an initial estimation of the segmentation which is provided by an alignment based on HMM. Afterwards, the boundaries are refined moving the frontier frames to the segment which is more similar to the speech frame. Gaussian PDFs are used as a similarity measure. The performance of the method is evaluated using the TIMIT database. If boundary deviations (from the reference position) larger than 20 ms are counted as errors, then the replacement of the boundaries reduces the error by 30%. Additional experiments show how the proposed method makes the performance independent of the speaker dependent or speaker independent data used to estimate the HMM
  • Keywords
    Gaussian distribution; hidden Markov models; speech processing; Gaussian PDF; Gaussian models; TIMIT database; alignment; automatic method; explicit speech segmentation; frontier frames; hidden Markov models; initial segmentation estimation; labeled speech segmentation; performance evaluation; similarity measure; speech frame; Acoustic measurements; Databases; Decoding; Error analysis; Hidden Markov models; Labeling; Loudspeakers; Speech processing; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607841
  • Filename
    607841