• DocumentCode
    417268
  • Title

    A new voice activity detector using subband order-statistics filters for robust speech recognition

  • Author

    Ramírez, J. ; Segura, J.C. ; Benirez, C. ; La Torre, A. De ; Rubio, A.

  • Author_Institution
    Dept. de Electron. y Tecnologia de Computadores, Granada Univ., Spain
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Currently, there are technology barriers inhibiting speech processing systems working under extreme noisy conditions. The emerging applications of speech technology, especially in the fields of wireless communications, digital hearing aids or speech recognition, are some examples of such systems often requiring a noise reduction technique in combination with a precise voice activity detector (VAD). This paper presents a new VAD for improving speech detection robustness in noisy environments and the performance of speech recognition systems. The algorithm uses long-term information about the speech signal to formulate the decision rule and estimates the subband SNR using specialized order statistics filters (OSF). The proposed algorithm is compared to the most commonly used VAD in the field, in terms of speech/nonspeech discrimination and also in terms of recognition performance when the VAD is used in an automatic speech recognition (ASR) system. Experimental results demonstrate a sustained advantage over different VAD methods including standard VAD such as G.729 and AMR which are used as a reference, the VAD of the Advanced Front-End (AFE) for distributed speech recognition (DSR), and recently reported algorithms.
  • Keywords
    decision trees; parameter estimation; signal detection; speech recognition; ASR system; VAD; automatic speech recognition; decision rule; noisy environments; performance; robust speech recognition; speech detection robustness; speech processing; subband SNR estimation; subband order-statistics filters; voice activity detector; Acoustical engineering; Automatic speech recognition; Detectors; Filters; Hearing aids; Noise robustness; Speech enhancement; Speech processing; Speech recognition; Wireless communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326119
  • Filename
    1326119