• DocumentCode
    3370502
  • Title

    Speech endpoint detection in strong noisy environment based on the Hilbert-Huang Transform

  • Author

    Lu, Zhimao ; Liu, Baisen ; Shen, Liran

  • Author_Institution
    Inf. & Commun. Eng. Coll., Harbin Eng. Univ., Harbin, China
  • fYear
    2009
  • fDate
    9-12 Aug. 2009
  • Firstpage
    4322
  • Lastpage
    4326
  • Abstract
    Speech endpoint detection in strong noise environment plays an important role in speech signal processing. Hilbert-Huang Transform (HHT) is based on the local characteristics of signals, which is an adaptive and efficient transformation method. It is particularly suitable for analyzing the non-linear and non-stationary signals such as speech signal. In this paper, we chose the noisy speech signal when the signal-to-noise ratio is negative. A novel algorithm for speech endpoint detection based on Hilbert-Huang transform is provided after analyzing the noisy speech signal. The signal is first decomposed by Empirical Mode Decomposition (EMD), and partial decomposition results are processed by Hilbert transform. The threshold of noise is estimated by analyzing the front of signal´s Hilbert amplitude spectrum. The speech segments and non-speech segments can be distinguished by the threshold and the whole signal´s Hilbert amplitude spectrum. Simulation results show that the speech signal can be effective detected by this algorithm at low signal-to-noise ratio.
  • Keywords
    Hilbert transforms; speech processing; speech recognition; EMD; Hilbert Huang transform; Hilbert amplitude spectrum; adaptive transformation method; efficient transformation method; noisy speech signal; nonlinear signals; nonspeech segments; nonstationary signals; signal-to-noise ratio; speech endpoint detection; speech segments; speech signal processing; strong noisy environment; Adaptive signal processing; Algorithm design and analysis; Signal analysis; Signal processing; Signal processing algorithms; Signal to noise ratio; Speech analysis; Speech enhancement; Speech processing; Working environment noise; Empirical Mode Decomposition (EMD); Hilbert-Huang Transform (HHT); Signal detection; Voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mechatronics and Automation, 2009. ICMA 2009. International Conference on
  • Conference_Location
    Changchun
  • Print_ISBN
    978-1-4244-2692-8
  • Electronic_ISBN
    978-1-4244-2693-5
  • Type

    conf

  • DOI
    10.1109/ICMA.2009.5246577
  • Filename
    5246577