مرکز منطقه ای اطلاع رساني علوم و فناوري - Novel Variations of Group Sparse Regularization Techniques With Applications to Noise Robust Automatic Speech Recognition

DocumentCode :

1391303

Title :

Novel Variations of Group Sparse Regularization Techniques With Applications to Noise Robust Automatic Speech Recognition

Author :

Tan, Qun Feng ; Narayanan, Shrikanth S.

Author_Institution :

Dept. of Electr. Eng., Univ. of Southern California Los Angeles, Los Angeles, CA, USA

Volume :

Issue :

fYear :

2012

fDate :

5/1/2012 12:00:00 AM

Firstpage :

1337

Lastpage :

1346

Abstract :

This paper presents novel variations of group sparse regularization techniques. We expand upon the Sparse Group LASSO formulation to incorporate different learning techniques for better sparsity enforcement within a group and demonstrate the effectiveness of the algorithms for spectral denoising with applications to robust Automatic Speech Recognition (ASR). In particular, we show that with a strategic selection of groupings greater robustness to noisy speech recognition can be achieved when compared to state-of-the-art techniques like the Fast Iterative Shrinkage Thresholding Algorithm (FISTA) implementation of the Sparse Group LASSO. Moreover, we demonstrate that group sparse regularization techniques can offer significant gains over efficient techniques like the Elastic Net. We also show that the proposed algorithms are effective in exploiting collinear dictionaries to deal with the inherent highly coherent nature of speech spectral segments. Experiments on the Aurora 2.0 continuous digit database and the Aurora 3.0 realistic noisy database demonstrate the performance improvement with the proposed methods, including showing that their execution time is comparable to FISTA, making our algorithms practical for application to a wide range of regularization problems.

Keywords :

learning (artificial intelligence); performance evaluation; signal denoising; spectral analysis; speech recognition; Aurora 2.0; Aurora 3.0; Elastic Net; Sparse Group LASSO formulation; continuous digit database; group sparse regularization technique; learning technique; noise robust automatic speech recognition; noisy database; performance improvement; robust ASR; sparsity enforcement; spectral denoising; speech spectral segments; Dictionaries; Equations; Mathematical model; Noise measurement; Noise reduction; Speech recognition; Vectors; Automatic speech recognition (ASR); denoising; group sparse regularization; sparse representation;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2011.2178596

Filename :

6096392

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1391303