DocumentCode :
3585080
Title :
A sparsity based preprocessing for noise robust speech recognition
Author :
Koniaris, Christos ; Chatterjee, Saikat
Author_Institution :
Dept. of Philos., Univ. of Gothenburg, Gothenburg, Sweden
fYear :
2014
Firstpage :
513
Lastpage :
518
Abstract :
We show a method to sparsify the speech input that improves the robustness of an automatic speech recognizer. The proposed scheme is added to the system as a preprocessing module prior to the acoustic feature extraction. The preprocessing module passes the input speech signal through a linear predictive (LP) analysis filter and enforces sparsity in the LP residue domain. The sparsified prediction residue finally is filtered to generate the speech signal for computing a sequence of conventional feature vectors used in automatic speech recognition (ASR). Using standard feature vectors, our experiments show that sparsification in LP residue domain improves robustness in ASR performance.
Keywords :
feature extraction; filtering theory; signal denoising; speech recognition; ASR; LP analysis filter; LP residue domain; acoustic feature extraction; automatic speech recognition; feature vectors; input speech signal; linear predictive analysis filter; noise robust speech recognition; robustness improvement; sparsified prediction residue filtering; sparsity based preprocessing module; speech input sparsification; speech signal generation; standard feature vectors; Abstracts; Accuracy; Mel frequency cepstral coefficient; Psychoacoustics; Signal to noise ratio; Speech; Speech recognition; feature extraction; linear predictive analysis; residue signal; robust speech recognition; sparsity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type :
conf
DOI :
10.1109/SLT.2014.7078627
Filename :
7078627
Link To Document :
بازگشت