Title :
Perceptually motivated speech enhancement based on constrained optimization
Author :
Borowicz, Adam ; Petrovsky, Alexander
Author_Institution :
Dept. of Real-Time Syst., Bialystok Tech. Univ., Bialystok, Poland
Abstract :
This paper is concerned with recently proposed perceptually constrained signal subspace (PCSS) method for speech enhancement. Optimal linear estimator minimizes average speech distortion power for a given set of the perceptually motivated constraints on the residual noise power spectrum. We also derived frequency-domain version of the PCSS method. This is done using a special assumption about structure of the covariance matrices. Surprisingly, the resulting estimator is almost identical to the well known IND (Inaudible Noise Distortion) rule. In other words we found the conditions for which an empirical IND rule is optimal. The experiments show that a intelligibility of the enhanced speech is similar for both methods, however in the case of noise attenuation the subspace-based methods are superior.
Keywords :
covariance matrices; frequency-domain analysis; optimisation; speech enhancement; constrained optimization; covariance matrices; frequency-domain version; inaudible noise distortion rule; noise attenuation; optimal linear estimator; perceptually constrained signal subspace; perceptually motivated speech enhancement; residual noise power spectrum; subspace-based methods; Covariance matrix; Estimation; Frequency domain analysis; Noise; Noise measurement; Speech; Speech enhancement;
Conference_Titel :
Signal Processing Algorithms, Architectures, Arrangements and Applications, 2007
Conference_Location :
Poznan
Print_ISBN :
978-1-4244-1514-4
Electronic_ISBN :
978-1-4244-1515-1
DOI :
10.1109/SPA.2007.5903319