DocumentCode
1858142
Title
Detection, separation and recognition of speech from continuous signals using spectral factorisation
Author
Hurmalainen, Antti ; Gemmeke, Jort F. ; Virtanen, Tuomas
Author_Institution
Tampere Univ. of Technol., Tampere, Finland
fYear
2012
fDate
27-31 Aug. 2012
Firstpage
2649
Lastpage
2653
Abstract
In real world speech processing, the signals are often continuous and consist of momentary segments of speech over non-stationary background noise. It has been demonstrated that spectral factorisation using multi-frame atoms can be successfully employed to separate and recognise speech in adverse conditions. While in previous work full knowledge of utterance endpointing and speaker identity was used for noise modelling and speech recognition, this study proposes spectral factorisation and sparse classification techniques to detect, identify, separate and recognise speech from a continuous noisy input. Speech models are trained beforehand, but noise models are acquired adaptively from the input by using voice activity detection without prior knowledge of noise-only locations. The results are evaluated on the CHiME corpus, containing utterances from 34 speakers over highly non-stationary multi-source noise.
Keywords
matrix decomposition; speech recognition; CHiME corpus; continuous noisy input; continuous signals; highly nonstationary multisource noise; multiframe atoms; nonstationary background noise; real world speech processing; sparse classification techniques; spectral factorisation techniques; speech detection; speech models; speech momentary segments; speech recognition; speech separation; voice activity detection; Adaptation models; Noise measurement; Signal to noise ratio; Spectrogram; Speech; Speech recognition; Spectral factorization; speaker recognition; speech recognition; speech separation; voice activity detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location
Bucharest
ISSN
2219-5491
Print_ISBN
978-1-4673-1068-0
Type
conf
Filename
6334329
Link To Document