DocumentCode
3585080
Title
A sparsity based preprocessing for noise robust speech recognition
Author
Koniaris, Christos ; Chatterjee, Saikat
Author_Institution
Dept. of Philos., Univ. of Gothenburg, Gothenburg, Sweden
fYear
2014
Firstpage
513
Lastpage
518
Abstract
We show a method to sparsify the speech input that improves the robustness of an automatic speech recognizer. The proposed scheme is added to the system as a preprocessing module prior to the acoustic feature extraction. The preprocessing module passes the input speech signal through a linear predictive (LP) analysis filter and enforces sparsity in the LP residue domain. The sparsified prediction residue finally is filtered to generate the speech signal for computing a sequence of conventional feature vectors used in automatic speech recognition (ASR). Using standard feature vectors, our experiments show that sparsification in LP residue domain improves robustness in ASR performance.
Keywords
feature extraction; filtering theory; signal denoising; speech recognition; ASR; LP analysis filter; LP residue domain; acoustic feature extraction; automatic speech recognition; feature vectors; input speech signal; linear predictive analysis filter; noise robust speech recognition; robustness improvement; sparsified prediction residue filtering; sparsity based preprocessing module; speech input sparsification; speech signal generation; standard feature vectors; Abstracts; Accuracy; Mel frequency cepstral coefficient; Psychoacoustics; Signal to noise ratio; Speech; Speech recognition; feature extraction; linear predictive analysis; residue signal; robust speech recognition; sparsity;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type
conf
DOI
10.1109/SLT.2014.7078627
Filename
7078627
Link To Document