DocumentCode
2576751
Title
Segmentation of speech signal into phonemes using two-level GMM tokenization
Author
Monica, T. ; Nagarajan, T.
Author_Institution
Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India
fYear
2011
fDate
3-5 June 2011
Firstpage
843
Lastpage
847
Abstract
This paper proposes an algorithm for identifying the phoneme boundaries in a given speech signal without the need for its orthographic transcription. The algorithm is a two level process whereby in the first level the phoneme boundaries are determined by silence/voiced/unvoiced classification and in the second level the voiced parts are alone tokenized further. TIMIT database was used to carry out the experiments and to check the correctness of the automatically detected phoneme boundaries. The experimental results showed that the performance of the algorithm in identifying the correct boundaries was ~75%.
Keywords
Gaussian processes; signal classification; speech processing; Gaussian mixture model; TIMIT database; phoneme boundaries; silence classification; speech signal segmentation; two-level GMM tokenization; unvoiced classification; voiced classification; Correlation; Feature extraction; Indexes; Smoothing methods; Speech; Speech recognition; Training; GMM; phoneme; segmentation; speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
Conference_Location
Chennai, Tamil Nadu
Print_ISBN
978-1-4577-0588-5
Type
conf
DOI
10.1109/ICRTIT.2011.5972311
Filename
5972311
Link To Document