• DocumentCode
    695641
  • Title

    A systematic strategy for robust automatic dialect identification

  • Author

    Liu, Gang A. ; Hansen, John H. L.

  • Author_Institution
    CRSS: Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    2138
  • Lastpage
    2141
  • Abstract
    Automatic dialect Classification is very important for speech based human computer interface and customer electronic products. Although many studies have been performed in ideal environment, little work has been done in noisy or small data corpus, both of which are very critical for the survival of a dialect identification system. This paper investigates a series of strategies to address the question of small and noisy dataset dialect classification task. A novel hierarchical universal background model is proposed to address the question of limited training dataset. To address the noisy question, we initiate the use of perceptual minimum variance distortionless response (PMVDR), combining with shifted delta cepstral (SDC) algorithm. Rotation forest is also explored to further improve the system performance. Finally, compared with the baseline system, the proposed best system shows relative gains of 31:8% and 28:7%, in the worse noise and clean condition on a small data set, respectively.
  • Keywords
    cepstral analysis; electronic products; human computer interaction; pattern classification; speech processing; speech recognition; speech-based user interfaces; PMVDR; SDC algorithm; automatic dialect classification; clean condition; customer electronic products; hierarchical universal background model; noisy dataset dialect classification task; perceptual minimum variance distortionless response; robust automatic dialect identification; shifted delta cepstral algorithm; speech based human computer interface; training dataset; worse noise; Feature extraction; Mel frequency cepstral coefficient; Noise; Noise measurement; Robustness; Speech; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7074191