DocumentCode
1558344
Title
A Mixture Model Approach for Formant Tracking and the Robustness of Student´s-t Distribution
Author
Sundar, Harshavardhan ; Seelamantula, Chandra Sekhar ; Sreenivas, Thippur V.
Author_Institution
Department of Electrical Communication Engineering, Indian Institute of Science, Bangalore, India
Volume
20
Issue
10
fYear
2012
Firstpage
2626
Lastpage
2636
Abstract
We address the problem of robust formant tracking in continuous speech in the presence of additive noise. We propose a new approach based on mixture modeling of the formant contours. Our approach consists of two main steps: (i) Computation of a pyknogram based on multiband amplitude-modulation/frequency-modulation (AM/FM) decomposition of the input speech; and (ii) Statistical modeling of the pyknogram using mixture models. We experiment with both Gaussian mixture model (GMM) and Student´s-t mixture model (tMM) and show that the latter is robust with respect to handling outliers in the pyknogram data, parameter selection, accuracy, and smoothness of the estimated formant contours. Experimental results on simulated data as well as noisy speech data show that the proposed tMM-based approach is also robust to additive noise. We present performance comparisons with a recently developed adaptive filterbank technique proposed in the literature and the classical Burg´s spectral estimator technique, which show that the proposed technique is more robust to noise.
Keywords
Amplitude modulation; Computational modeling; Frequency modulation; Gaussian mixture model; Hidden Markov models; Robustness; Speech; Formant tracking; Gaussian mixture model (GMM); Student´s-t mixture model (tMM); multimodal density estimation; statistical mixture modeling;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2012.2209418
Filename
6243191
Link To Document