Title :
An alternating ℓp — ℓ2 projections algorithm (ALPA) for speech modeling using sparsity constraints
Author :
Adiga, Aniruddha ; Seelamantula, Chandra Sekhar
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci. Bangalore, Bangalore, India
Abstract :
We address the problem of separating a speech signal into its excitation and vocal-tract filter components, which falls within the framework of blind deconvolution. Typically, the excitation in case of voiced speech is assumed to be sparse and the vocal-tract filter stable. We develop an alternating ℓp - ℓ2 projections algorithm (ALPA) to perform deconvolution taking into account these constraints. The algorithm is iterative, and alternates between two solution spaces. The initialization is based on the standard linear prediction decomposition of a speech signal into an autoregressive filter and prediction residue. In every iteration, a sparse excitation is estimated by optimizing an ℓp-norm-based cost and the vocal-tract filter is derived as a solution to a standard least-squares minimization problem. We validate the algorithm on voiced segments of natural speech signals and show applications to epoch estimation. We also present comparisons with state-of-the-art techniques and show that ALPA gives a sparser impulse-like excitation, where the impulses directly denote the epochs or instants of significant excitation.
Keywords :
deconvolution; filtering theory; iterative methods; least squares approximations; minimisation; regression analysis; speech processing; ℓp-norm-based cost; ALPA; alternating ℓp-ℓ2 projections algorithm; autoregressive filter; blind deconvolution; epoch estimation; excitation components; impulse-like excitation; iterative algorithm; natural speech signals; prediction residue; solution spaces; speech modeling; standard least-squares minimization problem; standard linear prediction decomposition; vocal-tract filter components; voiced segments; voiced speech; Deconvolution; Digital signal processing; Estimation; Signal processing algorithms; Signal to noise ratio; Speech; Standards; Deconvolution; Iteratively reweighted least-squares (IRLS) technique; Linear prediction; Sparsity constraints;
Conference_Titel :
Digital Signal Processing (DSP), 2014 19th International Conference on
Conference_Location :
Hong Kong
DOI :
10.1109/ICDSP.2014.6900673