• DocumentCode
    394277
  • Title

    Using speech/non-speech detection to bias recognition search on noisy data

  • Author

    Beaufays, Françoise ; Boies, Daniel ; Weintraub, Mitch ; Zhu, Qifeng

  • Author_Institution
    Nuance Commun., Menlo Park, CA, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper focuses on the recognition of noisy speech. We show that the decoding of a noisy speech waveform can be facilitated if the recognizer has explicit knowledge of where it should hypothesize speech phones, and where it should map the acoustics to non-speech phones. We build a speech/non-speech detector and use its output as an additional front-end feature. We show that by appropriately weighting the contribution of this feature in the decoder and by modifying the acoustic models accordingly, we can penalize speech/non-speech confusions and consequently reduce the recognition error rate. This approach gives a 12% overall error rate reduction on a wide variety of recognition tasks and noise characteristics without degrading performance on clean test data. A simple extension of the approach boosts recognition improvements on noisy test sets to 14% overall.
  • Keywords
    data compression; decoding; noise; signal detection; speech coding; speech recognition; acoustic models; bias recognition search; clean test data; front-end feature; noise characteristics; noisy data; noisy speech recognition; noisy speech waveform decoding; noisy test sets; nonspeech phones; recognition error rate reduction; speech phones; speech/nonspeech confusions; speech/nonspeech detection; Acoustic noise; Acoustic signal detection; Acoustic waves; Character recognition; Decoding; Degradation; Detectors; Error analysis; Noise reduction; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198808
  • Filename
    1198808