DocumentCode
1688294
Title
Stereo-based feature enhancement using dictionary learning
Author
Watanabe, Shigetaka ; Hershey, John R.
Author_Institution
Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA
fYear
2013
Firstpage
7073
Lastpage
7077
Abstract
This paper proposes stereo-based speech feature enhancement using dictionary learning. Instead of posterior values obtained by a Gaussian mixture as in other methods, we use sparse weight vectors and their variants as an alternative noisy speech feature representation. This paper also provides an efficient algorithm that can be applied to large-scale speech processing. We show the effectiveness of the proposed approach by using a middle vocabulary noisy speech recognition task based on WSJ, which was provided by the 2nd CHiME Speech Separation and Recognition Challenge.
Keywords
Gaussian processes; compressed sensing; feature extraction; learning (artificial intelligence); signal representation; speech enhancement; speech recognition; 2nd CHiME Challenge; Gaussian mixture; Speech Separation and Recognition Challenge; dictionary learning; large scale speech processing; middle vocabulary noisy speech recognition task; noisy speech feature representation; sparse weight vectors; stereo based speech feature enhancement; Bridges; 2nd CHiME challenge track 2; Speech recognition; dictionary learning; sparse representation; speech feature enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639034
Filename
6639034
Link To Document