• DocumentCode
    2800755
  • Title

    Adapting noisy speech models — Extended uncertainty decoding

  • Author

    Lu, Jianhua ; Ming, Ji ; Woods, Roger

  • Author_Institution
    Inst. of Electron., Commun. & Inf. Technol., Queen´´s Univ. Belfast, Belfast, UK
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4322
  • Lastpage
    4325
  • Abstract
    Most conventional techniques for noise adaptation assume a clean initial speech model which is adapted to a specific noise condition using adaptation data accumulated from the condition. In this paper, a different problem is considered, i.e. adapting a noisy speech model to a specific noise condition. For example, the initial noisy model may be a multi-condition model which is used to provide more accurate transcripts for the adaptation data than could be provided by a clean model, thereby obtaining a more accurate adaptation. We develop the formulation for this new problem by combining and extending maximum likelihood linear regression (MLLR), constrained MLLR (CMLLR) and uncertainty decoding techniques. We also present an implementation which has been tested on the Aurora 4 database, assuming an initial multi-condition model trained using white noise corrupted data. Significant word error rate (WER) reductions are achieved in comparison with other approaches.
  • Keywords
    decoding; maximum likelihood estimation; regression analysis; speech coding; Aurora 4 database; constrained maximum likelihood linear regression; extended uncertainty decoding; multicondition model; noise condition; noisy speech models; white noise corrupted data; word error rate reductions; Acoustic noise; Gaussian noise; Maximum likelihood decoding; Maximum likelihood linear regression; Noise robustness; Speech enhancement; Speech recognition; Transforms; Uncertainty; Working environment noise; Aurora 4; noise adaptation; noise compensation; noise robustness; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495660
  • Filename
    5495660