Title :
Whispered Speech Detection in Noise Using Auditory-Inspired Modulation Spectrum Features
Author :
Sarria-Paja, Milton ; Falk, Tiago H.
Author_Institution :
Inst. Nat. de la Rech. Sci. (INRS-EMT), Univ. of Quebec, Montreal, QC, Canada
Abstract :
Robustness to ambient noise, varying vocal effort, and availability of only short-duration test utterances represent big challenges for developers of automated speech-enabled applications. Recent studies have proposed the use of vocal effort-matched speaker models as a potential solution to such challenges. However, detecting whispered speech in extremely noisy environments is not a trivial task. This letter proposes the use of auditory-inspired modulation spectral-based features as a method of separating speech from environment-based components, thus resulting in accurate whispered speech detection at signal-to-noise ratios as low as 0 dB. Experimental results show the proposed detection algorithm outperforming two benchmark approaches.
Keywords :
Gaussian processes; feature extraction; modulation; speaker recognition; Gaussian mixture model; ambient noise; auditory inspired modulation spectrum features; noisy environments; short-duration test utterances; signal-to-noise ratio; speaker verification; vocal effort matched speaker models; whispered speech detection; Benchmark testing; Feature extraction; Frequency modulation; Noise; Noise measurement; Speech; Gaussian mixture model; modulation spectrum; speaker verification; speech detection; whispered speech;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2013.2266860