DocumentCode
1576011
Title
Unsupervised discovery of phoneme boundaries in multi-speaker continuous speech
Author
Armstrong, Tom ; Antetomaso, Stephanie
Author_Institution
Wheaton Coll., Norton, MA, USA
Volume
2
fYear
2011
Firstpage
1
Lastpage
5
Abstract
Children rapidly learn the inventory of phonemes used in their native tongues. Computational approaches to learning phoneme boundaries from speech data do not yet reach the level of human performance. We present an algorithm that operates on, qualitatively, similar data to those children receive: natural language utterances from multiple speakers. Our algorithm is unsupervised and discovers phoneme boundary positions in speech. The approach draws inspiration from the word and text segmentation literature. To demonstrate the efficacy of our algorithm on speech data, we present empirical results of our method using the TIMIT data set. Our method achieves F-measure scores in the 0.68 - 0.73 range for locating phoneme boundary positions.
Keywords
natural language processing; speech processing; human performance; multiple speakers; multispeaker continuous speech; natural language utterances; phoneme boundaries; speech data; text segmentation; unsupervised discovery; word segmentation; Entropy; Feature extraction; Gold; Manuals; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Development and Learning (ICDL), 2011 IEEE International Conference on
Conference_Location
Frankfurt am Main
ISSN
2161-9476
Print_ISBN
978-1-61284-989-8
Type
conf
DOI
10.1109/DEVLRN.2011.6037316
Filename
6037316
Link To Document