Title :
Stereo-based feature enhancement using dictionary learning
Author :
Watanabe, Shigetaka ; Hershey, John R.
Author_Institution :
Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA
Abstract :
This paper proposes stereo-based speech feature enhancement using dictionary learning. Instead of posterior values obtained by a Gaussian mixture as in other methods, we use sparse weight vectors and their variants as an alternative noisy speech feature representation. This paper also provides an efficient algorithm that can be applied to large-scale speech processing. We show the effectiveness of the proposed approach by using a middle vocabulary noisy speech recognition task based on WSJ, which was provided by the 2nd CHiME Speech Separation and Recognition Challenge.
Keywords :
Gaussian processes; compressed sensing; feature extraction; learning (artificial intelligence); signal representation; speech enhancement; speech recognition; 2nd CHiME Challenge; Gaussian mixture; Speech Separation and Recognition Challenge; dictionary learning; large scale speech processing; middle vocabulary noisy speech recognition task; noisy speech feature representation; sparse weight vectors; stereo based speech feature enhancement; Bridges; 2nd CHiME challenge track 2; Speech recognition; dictionary learning; sparse representation; speech feature enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639034