Title :
Speech probability distribution
Author :
Gazor, Saeed ; Zhang, Wei
Author_Institution :
Dept. of Electr. & Comput. Eng., Queen´´s Univ., Canada
fDate :
7/1/2003 12:00:00 AM
Abstract :
It is demonstrated that the distribution of speech samples is well described by Laplacian distribution (LD). The widely known speech distributions, i.e., LD, Gaussian distribution (GD), generalized GD, and gamma distribution, are tested as four hypotheses, and it is proved that speech samples during voice activity intervals are Laplacian random variables. A decorrelation transformation is then applied to speech samples to approximate their multivariate distribution. To do this, speech is decomposed using an adaptive Karhunen-Loeve transform or a discrete cosine transform. Then, the distributions of speech components in decorrelated domains are investigated. Experimental evaluations prove that the statistics of speech signals are like a multivariate LD. All marginal distributions of speech are accurately described by LD in decorrelated domains. While the energies of speech components are time-varying, their distribution shape remains Laplacian.
Keywords :
Gaussian distribution; Karhunen-Loeve transforms; decorrelation; discrete cosine transforms; gamma distribution; signal sampling; speech processing; DCT; Gaussian distribution; Laplacian distribution; Laplacian random variables; adaptive Karhunen-Loeve transform; decorrelated domains; decorrelation transformation; discrete cosine transform; gamma distribution; generalized GD; marginal distributions; multivariate distribution approximation; speech probability distribution; speech samples distribution; speech signal statistics; time-varying speech components energy; Decorrelation; Discrete cosine transforms; Gaussian distribution; Karhunen-Loeve transforms; Laplace equations; Probability distribution; Random variables; Speech analysis; Statistical distributions; Testing;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2003.813679