Title :
A systematic strategy for robust automatic dialect identification
Author :
Liu, Gang A. ; Hansen, John H. L.
Author_Institution :
CRSS: Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
fDate :
Aug. 29 2011-Sept. 2 2011
Abstract :
Automatic dialect Classification is very important for speech based human computer interface and customer electronic products. Although many studies have been performed in ideal environment, little work has been done in noisy or small data corpus, both of which are very critical for the survival of a dialect identification system. This paper investigates a series of strategies to address the question of small and noisy dataset dialect classification task. A novel hierarchical universal background model is proposed to address the question of limited training dataset. To address the noisy question, we initiate the use of perceptual minimum variance distortionless response (PMVDR), combining with shifted delta cepstral (SDC) algorithm. Rotation forest is also explored to further improve the system performance. Finally, compared with the baseline system, the proposed best system shows relative gains of 31:8% and 28:7%, in the worse noise and clean condition on a small data set, respectively.
Keywords :
cepstral analysis; electronic products; human computer interaction; pattern classification; speech processing; speech recognition; speech-based user interfaces; PMVDR; SDC algorithm; automatic dialect classification; clean condition; customer electronic products; hierarchical universal background model; noisy dataset dialect classification task; perceptual minimum variance distortionless response; robust automatic dialect identification; shifted delta cepstral algorithm; speech based human computer interface; training dataset; worse noise; Feature extraction; Mel frequency cepstral coefficient; Noise; Noise measurement; Robustness; Speech; Training;
Conference_Titel :
Signal Processing Conference, 2011 19th European
Conference_Location :
Barcelona