DocumentCode :
2792156
Title :
Broad phoneme class based speech enhancement using mixture maximum model
Author :
Das, Amit ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4762
Lastpage :
4765
Abstract :
This study develops a speech enhancement technique that uses a series of prior enhanced speech utterances, each optimized for a specific broad phoneme class, to generate a single, composite utterance to improve objective quality scores over all phoneme classes. The noisy utterance is partitioned into phoneme class segments using probabilistic decisions made from the mixture maximum model (MIXMAX). Based on these phoneme class decisions, the composite segment is constructed using a combination of the prior enhanced utterances. The enhancement system that generates multiple enhanced utterances is assumed to belong to the class of short-time spectral magnitude estimators which either minimizes the weighted Euclidean distortion (WED) between clean speech and clean speech estimate spectral magnitudes or which finds the joint MAP(JMAP) estimate of clean speech spectral magnitude and phase. Performance evaluations of the composite utterance exhibit better performance than the individual utterances over all phoneme classes in most cases of the noise types and SNR levels considered.
Keywords :
distortion; speech enhancement; SNR levels; broad mixture maximum model; objective quality scores; phoneme class based speech enhancement; spectral magnitude estimators; weighted Euclidean distortion; Acoustic distortion; Acoustic noise; Additive noise; Automatic speech recognition; Hidden Markov models; Maximum likelihood estimation; Phase estimation; Robustness; Speech enhancement; State estimation; MIXMAX model; joint MAP estimation; speech enhancement; weighted Euclidean distortion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495158
Filename :
5495158
Link To Document :
بازگشت