• DocumentCode
    2950195
  • Title

    Noise Characterization and Classification for Background Estimation

  • Author

    Maithani, Sunita ; Tyagi, Richa

  • Author_Institution
    Sci. Anal. Group Defense Res. & Dev. Organ., Delhi
  • fYear
    2008
  • fDate
    4-6 Jan. 2008
  • Firstpage
    208
  • Lastpage
    213
  • Abstract
    Characterization and classification of noise in the noisy speech is an important part of noisy speech processing in all related fields of application for speech technology. In this paper, problem of noise characterization and categorization of various environmental noises in the noisy speech of different SNR´s (Signal to Noise Ratio) has been handled. This paper purposes a technique for estimation of noise level and classifying the noises in a noisy speech into different categories of noises for background estimation. The technique detects non-speech regions in the noisy speech for SNR estimation and noise classification purpose. Three dimensional representation of noise is obtained using technique, which discriminates visual imprints of different noises in most optimum way in the 2D plane. For classification of test noise, prominent features from the 2D representation of noise are selected and classified using minimum hamming distance classifier using reference standard noise. Two dimensional second order derivatives of FFT magnitude matrix w.r.t. time and frequency are computed from the extracted pause regions. The technique is computationally simple and efficient and gives very high level of accuracy in the noisy speech with segmental SNR range of 20 dB to -20 dB. The test noises are categorized into white, babble, factory, vehicular and channel noises. The efficacy of technique developed is dependent on the limitations of the particular speech and pause detection algorithm used, particularly at moderate value of SNR. At lower value of SNR no such limitations exists and accuracy goes up to even 100%.
  • Keywords
    fast Fourier transforms; feature extraction; noise (working environment); signal classification; signal detection; signal representation; speech processing; speech recognition; FFT; SNR estimation; babble noise; background estimation; channel noise; environmental noises; factory noise; fast Fourier transform; minimum hamming distance classifier; noise characterization; noise classification; noisy speech processing; nonspeech region detection; reference standard noise; speech-pause detection algorithm; three dimensional noise representation; vehicular noise; white noise; Background noise; Frequency; Hamming distance; Noise level; Production facilities; Signal to noise ratio; Speech enhancement; Speech processing; Testing; Working environment noise; FFT (Fast Fourier Transform); LPC; LSF; MDC (Minimum Distance Classifier); SNR; Speech-Pause Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Communications and Networking, 2008. ICSCN '08. International Conference on
  • Conference_Location
    Chennai
  • Print_ISBN
    978-1-4244-1924-1
  • Electronic_ISBN
    978-1-4244-1924-1
  • Type

    conf

  • DOI
    10.1109/ICSCN.2008.4447190
  • Filename
    4447190