Stereo-based feature enhancement using dictionary learning

Author

Watanabe, Shigetaka ; Hershey, John R.

Author_Institution

Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA

fYear

2013

Firstpage

7073

Lastpage

7077

Abstract

This paper proposes stereo-based speech feature enhancement using dictionary learning. Instead of posterior values obtained by a Gaussian mixture as in other methods, we use sparse weight vectors and their variants as an alternative noisy speech feature representation. This paper also provides an efficient algorithm that can be applied to large-scale speech processing. We show the effectiveness of the proposed approach by using a middle vocabulary noisy speech recognition task based on WSJ, which was provided by the 2nd CHiME Speech Separation and Recognition Challenge.

Keywords

Gaussian processes; compressed sensing; feature extraction; learning (artificial intelligence); signal representation; speech enhancement; speech recognition; 2nd CHiME Challenge; Gaussian mixture; Speech Separation and Recognition Challenge; dictionary learning; large scale speech processing; middle vocabulary noisy speech recognition task; noisy speech feature representation; sparse weight vectors; stereo based speech feature enhancement; Bridges; 2nd CHiME challenge track 2; Speech recognition; dictionary learning; sparse representation; speech feature enhancement;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location

Vancouver, BC

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2013.6639034

Filename

6639034