DocumentCode
1882871
Title
Higher order statistics based Gaussianity test applied to on-line speech processing
Author
Rangoussi, Maria ; Carayannis, George
Author_Institution
Dept. of Electr. Eng., Nat. Tech. Univ. of Athens, Greece
Volume
1
fYear
1994
fDate
31 Oct-2 Nov 1994
Firstpage
303
Abstract
Detection of speech in noisy recordings becomes a challenging problem when the noise does not follow the usual whiteness, stationarity and high signal-to-noise ratio assumptions. A robust speech detector can affect significantly the performance of several speech processing tasks, such as endpoint detection, segmentation, and finally recognition, if we deal with real life data, as opposed to laboratory or controlled environment recordings. The detector proposed is based on a Gaussianity test that employs third-order cumulants of the data to decide on the binary hypotheses of noise only versus speech plus noise. Speech intervals are detected by exploiting the third-order information present in the speech signal. The detector can handle a large family of additive noises, thanks to its third-order statistics basis. The sample-adaptive and decision feedback variations proposed, provide the detector with a tracking ability both with respect to the time variations of speech and the possible nonstationarity of noise. Experiments carried out using real data, recorded in a moving car interior, show satisfactory performance of the proposed algorithms down to -6 dB signal-to-noise ratio
Keywords
Gaussian processes; acoustic signal detection; adaptive signal detection; adaptive signal processing; audio recording; feedback; higher order statistics; noise; signal sampling; speech processing; tracking; 6 dB; Gaussianity test; additive noises; binary hypotheses; decision feedback variations; endpoint detection; experiments; higher order statistics; moving car interior; noise nonstationarity; noisy recordings; on-line speech processing; robust speech detector; sample-adaptive variations; signal-to-noise ratio; speech detection; speech intervals; speech plus noise; speech segmentation; third-order cumulants; third-order statistics; time variations; tracking ability; Detectors; Gaussian processes; Higher order statistics; Noise robustness; Robust control; Signal to noise ratio; Speech enhancement; Speech processing; Testing; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 1994. 1994 Conference Record of the Twenty-Eighth Asilomar Conference on
Conference_Location
Pacific Grove, CA
ISSN
1058-6393
Print_ISBN
0-8186-6405-3
Type
conf
DOI
10.1109/ACSSC.1994.471465
Filename
471465
Link To Document