DocumentCode
1694040
Title
Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech
Author
Nouza, Jan ; Cerva, Petr ; Silovsky, Jan
Author_Institution
SpeechLab, Tech. Univ. of Liberec, Liberec, Czech Republic
fYear
2013
Firstpage
8046
Lastpage
8050
Abstract
This paper deals with the recognition of speech whose spectrum is notably distorted by lossy compression (namely MP3) or by some implementations of `speech enhancement´ techniques. We show that these non-linear treatments can introduce gaps in spectrum that significantly change the distribution of MFCCs and degrade performance of ASR. We propose a method that measures the level of spectrum distortion and use it for adding a controlled amount of noise to the signal. It effectively masks the gaps and helps namely in situations where the source and parameters of the distortion are not known and hence we cannot use a properly matched acoustic model. In spite of its simplicity, the method can improve significantly speech recognition of highly compressed or spectrally distorted signals. We demonstrate it in several large experiments conducted on publicly available speech databases, in two languages and for two types of spectral distortion.
Keywords
speech coding; speech enhancement; speech recognition; ASR; MFCC; MP3; acoustic model; compressed speech signal; nonlinear treatment; spectrally distorted signal; spectrum distortion; speech databases; speech enhancement technique; speech recognition; Abstracts; Acoustics; Face recognition; Noise; Speech; Speech coding; Speech recognition; MP3; compressed speech; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639232
Filename
6639232
Link To Document