• DocumentCode
    1688294
  • Title

    Stereo-based feature enhancement using dictionary learning

  • Author

    Watanabe, Shigetaka ; Hershey, John R.

  • Author_Institution
    Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA
  • fYear
    2013
  • Firstpage
    7073
  • Lastpage
    7077
  • Abstract
    This paper proposes stereo-based speech feature enhancement using dictionary learning. Instead of posterior values obtained by a Gaussian mixture as in other methods, we use sparse weight vectors and their variants as an alternative noisy speech feature representation. This paper also provides an efficient algorithm that can be applied to large-scale speech processing. We show the effectiveness of the proposed approach by using a middle vocabulary noisy speech recognition task based on WSJ, which was provided by the 2nd CHiME Speech Separation and Recognition Challenge.
  • Keywords
    Gaussian processes; compressed sensing; feature extraction; learning (artificial intelligence); signal representation; speech enhancement; speech recognition; 2nd CHiME Challenge; Gaussian mixture; Speech Separation and Recognition Challenge; dictionary learning; large scale speech processing; middle vocabulary noisy speech recognition task; noisy speech feature representation; sparse weight vectors; stereo based speech feature enhancement; Bridges; 2nd CHiME challenge track 2; Speech recognition; dictionary learning; sparse representation; speech feature enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639034
  • Filename
    6639034