DocumentCode
3370502
Title
Speech endpoint detection in strong noisy environment based on the Hilbert-Huang Transform
Author
Lu, Zhimao ; Liu, Baisen ; Shen, Liran
Author_Institution
Inf. & Commun. Eng. Coll., Harbin Eng. Univ., Harbin, China
fYear
2009
fDate
9-12 Aug. 2009
Firstpage
4322
Lastpage
4326
Abstract
Speech endpoint detection in strong noise environment plays an important role in speech signal processing. Hilbert-Huang Transform (HHT) is based on the local characteristics of signals, which is an adaptive and efficient transformation method. It is particularly suitable for analyzing the non-linear and non-stationary signals such as speech signal. In this paper, we chose the noisy speech signal when the signal-to-noise ratio is negative. A novel algorithm for speech endpoint detection based on Hilbert-Huang transform is provided after analyzing the noisy speech signal. The signal is first decomposed by Empirical Mode Decomposition (EMD), and partial decomposition results are processed by Hilbert transform. The threshold of noise is estimated by analyzing the front of signal´s Hilbert amplitude spectrum. The speech segments and non-speech segments can be distinguished by the threshold and the whole signal´s Hilbert amplitude spectrum. Simulation results show that the speech signal can be effective detected by this algorithm at low signal-to-noise ratio.
Keywords
Hilbert transforms; speech processing; speech recognition; EMD; Hilbert Huang transform; Hilbert amplitude spectrum; adaptive transformation method; efficient transformation method; noisy speech signal; nonlinear signals; nonspeech segments; nonstationary signals; signal-to-noise ratio; speech endpoint detection; speech segments; speech signal processing; strong noisy environment; Adaptive signal processing; Algorithm design and analysis; Signal analysis; Signal processing; Signal processing algorithms; Signal to noise ratio; Speech analysis; Speech enhancement; Speech processing; Working environment noise; Empirical Mode Decomposition (EMD); Hilbert-Huang Transform (HHT); Signal detection; Voice activity detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Mechatronics and Automation, 2009. ICMA 2009. International Conference on
Conference_Location
Changchun
Print_ISBN
978-1-4244-2692-8
Electronic_ISBN
978-1-4244-2693-5
Type
conf
DOI
10.1109/ICMA.2009.5246577
Filename
5246577
Link To Document