Title :
Optimal Coding of Generalized-Gaussian-Distributed Frequency Spectra for Low-Delay Audio Coder With Powered All-Pole Spectrum Estimation
Author :
Sugiura, Ryosuke ; Kamamoto, Yutaka ; Harada, Noboru ; Kameoka, Hirokazu ; Moriya, Takehiro
Author_Institution :
Nippon Telegraph & Telephone Corp., Atsugi, Japan
Abstract :
We present an optimal coding scheme that parameterizes the maximum-likelihood estimate of variance for frequency spectra belonging to the generalized Gaussian distribution, the distribution covering the Laplacian and the Gaussian. By slightly modifying the all-pole model of the conventional linear prediction (LP), we can estimate the variance with the same method as in LP, which has low computational costs. Experimental results show that incorporating the coding scheme in a state-of-the-art wide-band audio coder enhances its objective and subjective quality in a low-bit-rate and low-delay situation by increasing the compression efficiency. Thus, this coding scheme will be useful in applications like mobile communications, which requires highly efficient compression.
Keywords :
Gaussian distribution; audio coding; data compression; linear predictive coding; maximum likelihood estimation; all-pole model; all-pole spectrum estimation; compression efficiency; conventional linear prediction; generalized Gaussian distributed frequency spectra; low delay audio coder; maximum likelihood estimation; objective quality enhancement; optimal coding scheme; variance estimation; wideband audio coder; Encoding; Frequency-domain analysis; Laplace equations; Predictive models; Shape; Speech; Speech processing; Arithmetic coding; audio compression; generalized Gaussian distribution; linear prediction; low delay; transform coded excitation;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2015.2431851