• DocumentCode
    1576011
  • Title

    Unsupervised discovery of phoneme boundaries in multi-speaker continuous speech

  • Author

    Armstrong, Tom ; Antetomaso, Stephanie

  • Author_Institution
    Wheaton Coll., Norton, MA, USA
  • Volume
    2
  • fYear
    2011
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Children rapidly learn the inventory of phonemes used in their native tongues. Computational approaches to learning phoneme boundaries from speech data do not yet reach the level of human performance. We present an algorithm that operates on, qualitatively, similar data to those children receive: natural language utterances from multiple speakers. Our algorithm is unsupervised and discovers phoneme boundary positions in speech. The approach draws inspiration from the word and text segmentation literature. To demonstrate the efficacy of our algorithm on speech data, we present empirical results of our method using the TIMIT data set. Our method achieves F-measure scores in the 0.68 - 0.73 range for locating phoneme boundary positions.
  • Keywords
    natural language processing; speech processing; human performance; multiple speakers; multispeaker continuous speech; natural language utterances; phoneme boundaries; speech data; text segmentation; unsupervised discovery; word segmentation; Entropy; Feature extraction; Gold; Manuals; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Development and Learning (ICDL), 2011 IEEE International Conference on
  • Conference_Location
    Frankfurt am Main
  • ISSN
    2161-9476
  • Print_ISBN
    978-1-61284-989-8
  • Type

    conf

  • DOI
    10.1109/DEVLRN.2011.6037316
  • Filename
    6037316