• DocumentCode
    2576751
  • Title

    Segmentation of speech signal into phonemes using two-level GMM tokenization

  • Author

    Monica, T. ; Nagarajan, T.

  • Author_Institution
    Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India
  • fYear
    2011
  • fDate
    3-5 June 2011
  • Firstpage
    843
  • Lastpage
    847
  • Abstract
    This paper proposes an algorithm for identifying the phoneme boundaries in a given speech signal without the need for its orthographic transcription. The algorithm is a two level process whereby in the first level the phoneme boundaries are determined by silence/voiced/unvoiced classification and in the second level the voiced parts are alone tokenized further. TIMIT database was used to carry out the experiments and to check the correctness of the automatically detected phoneme boundaries. The experimental results showed that the performance of the algorithm in identifying the correct boundaries was ~75%.
  • Keywords
    Gaussian processes; signal classification; speech processing; Gaussian mixture model; TIMIT database; phoneme boundaries; silence classification; speech signal segmentation; two-level GMM tokenization; unvoiced classification; voiced classification; Correlation; Feature extraction; Indexes; Smoothing methods; Speech; Speech recognition; Training; GMM; phoneme; segmentation; speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
  • Conference_Location
    Chennai, Tamil Nadu
  • Print_ISBN
    978-1-4577-0588-5
  • Type

    conf

  • DOI
    10.1109/ICRTIT.2011.5972311
  • Filename
    5972311