Title :
Hidden Markov model-based speech enhancement using multivariate Laplace and Gaussian distributions
Author :
Aroudi, Ali ; Veisi, Hadi ; Sameti, Hossein
Author_Institution :
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran, Iran
Abstract :
In this paper, statistical speech enhancement using hidden Markov model (HMM) is studied and new techniques for applying non-Gaussian distributions are proposed. The superiority of using non-Gaussian distributions in online adaptive noise suppression algorithms has been proven; however, in this study, this approach is formulated in an HMM-based mean-square error estimator (MMSE) estimator in which a priori models are trained in an off-line manner. In addition, an analytical study of using different distributions other than autoregressive (AR) Gaussian distribution, such as Laplace, is presented in order to construct an accurate HMM as a priori model for discrete Fourier transform and discrete cosine transform feature vectors of speech signal. In the proposed framework, an HMM-based MMSE estimator bassed on Gaussian assumption using diagonal covariance matrix is provided rather than AR hypothesis which is employed in the conventional AR-HMM-based speech enhancement algorithm. Experimental evaluations of the proposed methods are done in the presence of four different noise types at various signal-to-noise ratio levels which demonstrate the superiority of the proposed methods in most conditions in comparison with AR-HMM.
Keywords :
Gaussian distribution; Laplace transforms; autoregressive processes; covariance matrices; discrete Fourier transforms; discrete cosine transforms; hidden Markov models; mean square error methods; speech enhancement; AR Gaussian distribution hypothesis; AR-HMM-based enhancement algorithm; HMM-based MMSE estimator; HMM-based mean-square error estimator; a priori model; autoregressive Gaussian distribution assumption; diagonal covariance matrix; discrete Fourier transform; discrete cosine transform feature vectors; hidden Markov models; multivariate Laplace distribution; nonGaussian distributions; online adaptive noise suppression algorithms; signal-to-noise ratio levels; speech signal; statistical speech enhancement;
Journal_Title :
Signal Processing, IET
DOI :
10.1049/iet-spr.2014.0032