DocumentCode :
3723928
Title :
Speech enhancement using Discriminative Random Fields
Author :
Ajey Saligrama; Ranjani H. G.;H. N. Shankarz;R. Muralishankarx
Author_Institution :
PESIT-Bangalore South Campus, India
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
Speech enhancement in stationary noise is addressed using the ideal channel selection framework. In order to estimate the binary mask, we propose to classify each time-frequency (T-F) bin of the noisy signal as speech or noise using Discriminative Random Fields (DRF). The DRF function contains two terms - an enhancement function and a smoothing term. On each T-F bin, we propose to use an enhancement function based on likelihood ratio test for speech presence, while Ising model is used as smoothing function for spectro-temporal continuity in the estimated binary mask. The effect of the smoothing function over successive iterations is found to reduce musical noise as opposed to using only enhancement function. The binary mask is inferred from the noisy signal using Iterated Conditional Modes (ICM) algorithm. Sentences from NOIZEUS corpus are evaluated from 0 dB to 15 dB Signal to Noise Ratio (SNR) in 4 kinds of additive noise settings: additive white Gaussian noise, car noise, street noise and pink noise. The reconstructed speech using the proposed technique is evaluated in terms of average segmental SNR, Perceptual Evaluation of Speech Quality (PESQ) and Mean opinion Score (MOS).
Keywords :
"Speech","Signal to noise ratio","Transforms","AWGN","Additives","Indexes","Noise measurement"
Publisher :
ieee
Conference_Titel :
TENCON 2015 - 2015 IEEE Region 10 Conference
ISSN :
2159-3442
Print_ISBN :
978-1-4799-8639-2
Electronic_ISBN :
2159-3450
Type :
conf
DOI :
10.1109/TENCON.2015.7373173
Filename :
7373173
Link To Document :
بازگشت