Title :
Applications of Dirichlet Process Mixtures to speaker adaptation
Author :
Torbati, Amir Hossein Harati Nejad ; Picone, Joe ; Sobel, Marc
Author_Institution :
Dept. of Electr. & Comp. Eng., Temple Univ., Philadelphia, PA, USA
Abstract :
Balancing unique acoustic characteristics of a speaker such as identity and accent, with general acoustic behavior that describes phoneme identity, is one of the great challenges in applying nonparametric Bayesian approaches to speaker adaptation. The Dirichlet Process Mixture (DPM) is a relatively new model that provides an elegant framework in which individual characteristics can be balanced with aggregate behavior without diluting the quality of the individual models. Unlike Gaussian Mixture models (GMMs), which tend to smear multimodal behavior through averaging, the DPM model attempts to preserve unique behaviors through use of an infinite mixture model. In this paper, we present some exploratory research on applying these models to the acoustic modeling component of the speaker adaptation problem. DPM based models are shown to provide up to 10% reduction in WER over maximum likelihood linear regression (MLLR) on a speaker adaptation task based on the Resource Management database.
Keywords :
Bayes methods; maximum likelihood estimation; regression analysis; speaker recognition; Dirichlet process mixtures; aggregate behavior; elegant framework; general acoustic behavior; infinite mixture; maximum likelihood linear regression; nonparametric Bayesian approach; phoneme identity; resource management database; smear multimodal behavior; speaker adaptation; unique acoustic characteristics; Adaptation models; Clustering algorithms; Computational modeling; Hidden Markov models; Inference algorithms; Mathematical model; Regression tree analysis; Dirichlet Process Mixture; nonparametric Bayesian models; speaker adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288875