• DocumentCode
    2876135
  • Title

    Recent advances in PD-MEMLIN for speech recognition in car conditions

  • Author

    Buera, Luis ; Lleida, Eduardo ; Miguel, Antonio ; Ortega, Alfonso

  • Author_Institution
    Aragon Inst. of Eng. Res., Zaragoza Univ.
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    180
  • Lastpage
    185
  • Abstract
    In a previous work, phoneme-dependent multi-environment models based linear normalization, PD-MEMLIN, was presented and it was proved to be effective to compensate environment mismatch. Since PD-MEMLIN transformations have to be estimated from stereo data corpora, and the computational cost is high, two approaches are proposed: coefficient progressive PD-MEMLIN, CPPD-MEMLIN, and blind PD-MEMLIN. The first one consists on a partial normalization of the feature vector, reducing the computational cost, while blind PD-MEMLIN can be applied over any non stereo data corpora, thus the estimation of the transformation is based on an iterative technique from noisy data and a target clean speech model. Some experiments with SpeechDat car database were carried out in order to study the behavior of the proposed techniques in a real acoustic environment. In the previous work, PD-MEMLIN with stereo data and normalizing 13 MFCC coefficients reached 77.67% of improvement. In this paper, CPPD-MEMLEM with only 4 coefficients obtains an average improvement of 72.40%, and blind PD-MEMLIN obtains an average improvement of 73.96%
  • Keywords
    automotive engineering; speech processing; speech recognition; SpeechDat car database; car conditions; clean speech model; coefficient progressive PD-MEMLIN; linear normalization; phoneme-dependent multi-environment models; speech recognition; stereo data corpora; Acoustic noise; Communications technology; Computational efficiency; Gaussian processes; Mel frequency cepstral coefficient; Noise reduction; Spatial databases; Speech recognition; Vectors; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566542
  • Filename
    1566542