• DocumentCode
    695646
  • Title

    Sub-band spectral variance feature for noise robust ASR

  • Author

    Maganti, HariKrishna ; Zanon, Silvia ; Matassoni, Marco ; Brutti, Alessio

  • Author_Institution
    Center for Inf. Technol., Fondazione Bruno Kessler, Trento, Italy
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    2114
  • Lastpage
    2118
  • Abstract
    The goal of this work is to improve automatic speech recognition (ASR) performance in very noisy and reverberant environments. The solution is based on extracting sub-band spectral variance normalization based features, which are capable of estimating the relative strengths of speech and noise components both in presence and absence of speech. The advanced ETSI-2 frontend, RASTA-PLP, MFCC alone and in combination with spectral subtraction are tested for comparison purposes. Speech recognition evaluations are performed on the noisy standard AURORA-2 and meeting recorder digit (MRD) subset of AURORA-5 databases, which represent additive noise and reverberant acoustic conditions. The results reveal that the proposed method is robust and reliable for both low SNR and reverberant scenarios, and provide considerable improvements with respect to the traditional feature extraction techniques.
  • Keywords
    feature extraction; reverberation; speech recognition; AURORA-5 databases; MFCC; MRD subset; RASTA-PLP; additive noise; advanced ETSI-2 frontend; automatic speech recognition performance; meeting recorder digit subset; noise robust ASR; noisy environments; noisy standard AURORA-2 databases; reverberant acoustic conditions; reverberant environments; spectral subtraction; sub-band spectral variance normalization based feature extraction; Databases; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7074196