• DocumentCode
    244901
  • Title

    Dual-Domain Hierarchical Classification of Phonetic Time Series

  • Author

    Hamooni, Hossein ; Mueen, Abdullah

  • Author_Institution
    Dept. of Comput. Sci., Univ. of New Mexico, Albuquerque, NM, USA
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    160
  • Lastpage
    169
  • Abstract
    Phonemes are the smallest units of sound produced by a human being. Automatic classification of phonemes is a well-researched topic in linguistics due to its potential for robust speech recognition. With the recent advancement of phonetic segmentation algorithms, it is now possible to generate datasets of millions of phonemes automatically. Phoneme classification on such datasets is a challenging data mining task because of the large number of classes (over a hundred) and complexities of the existing methods. In this paper, we introduce the phoneme classification problem as a data mining task. We propose a dual-domain (time and frequency) hierarchical classification algorithm. Our method uses a Dynamic Time Warping (DTW) based classifier in the top layers and time-frequency features in the lower layer. We cross-validate our method on phonemes from three online dictionaries and achieved up to 35% improvement in classification compared to existing techniques. We provide case studies on classifying accented phonemes and speaker invariant phoneme classification.
  • Keywords
    data mining; linguistics; signal classification; speech recognition; time series; DTW based classifier; automatic classification; data mining; datasets; dual-domain hierarchical classification; dynamic time warping; frequency hierarchical classification algorithm; linguistics; online dictionaries; phonetic segmentation algorithms; phonetic time series; robust speech recognition; sound units; speaker invariant phoneme classification; time hierarchical classification algorithm; time-frequency features; Accuracy; Dictionaries; Robustness; Speech; Speech recognition; Standards; Time series analysis; Big data; Phoneme classification; time series mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.92
  • Filename
    7023333