• DocumentCode
    2799316
  • Title

    Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition

  • Author

    Seltzer, Michael L. ; Acero, Alex ; Kalgaonkar, Kaustubh

  • Author_Institution
    Microsoft Res., Redmond, WA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4550
  • Lastpage
    4553
  • Abstract
    We recently proposed a new algorithm to perform acoustic model adaptation to noisy environments called Linear Spline Interpolation (LSI). In this method, the nonlinear relationship between clean and noisy speech features is modeled using linear spline regression. Linear spline parameters that minimize the error the between the predicted noisy features and the actual noisy features are learned from training data. A variance associated with each spline segment captures the uncertainty in the assumed model. In this work, we extend the LSI algorithm in two ways. First, the adaptation scheme is extended to compensate for the presence of linear channel distortion. Second, we show how the noise and channel parameters can be updated during decoding in an unsupervised manner within the LSI framework. Using LSI, we obtain an average relative improvement in word error rate of 10.8% over VTS adaptation on the Aurora 2 task with improvements of 15-18% at SNRs between 10 and 15 dB.
  • Keywords
    feature extraction; interpolation; regression analysis; signal denoising; speech recognition; LSI algorithm; acoustic model adaptation; channel parameter; error minimization; linear channel distortion; linear spline interpolation; noise parameter; noisy environment; noisy speech feature; robust speech recognition; Acoustic noise; Adaptation model; Interpolation; Large scale integration; Robustness; Speech recognition; Spline; Training data; Uncertainty; Working environment noise; model adaptation; robust speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495581
  • Filename
    5495581