• DocumentCode
    1858142
  • Title

    Detection, separation and recognition of speech from continuous signals using spectral factorisation

  • Author

    Hurmalainen, Antti ; Gemmeke, Jort F. ; Virtanen, Tuomas

  • Author_Institution
    Tampere Univ. of Technol., Tampere, Finland
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    2649
  • Lastpage
    2653
  • Abstract
    In real world speech processing, the signals are often continuous and consist of momentary segments of speech over non-stationary background noise. It has been demonstrated that spectral factorisation using multi-frame atoms can be successfully employed to separate and recognise speech in adverse conditions. While in previous work full knowledge of utterance endpointing and speaker identity was used for noise modelling and speech recognition, this study proposes spectral factorisation and sparse classification techniques to detect, identify, separate and recognise speech from a continuous noisy input. Speech models are trained beforehand, but noise models are acquired adaptively from the input by using voice activity detection without prior knowledge of noise-only locations. The results are evaluated on the CHiME corpus, containing utterances from 34 speakers over highly non-stationary multi-source noise.
  • Keywords
    matrix decomposition; speech recognition; CHiME corpus; continuous noisy input; continuous signals; highly nonstationary multisource noise; multiframe atoms; nonstationary background noise; real world speech processing; sparse classification techniques; spectral factorisation techniques; speech detection; speech models; speech momentary segments; speech recognition; speech separation; voice activity detection; Adaptation models; Noise measurement; Signal to noise ratio; Spectrogram; Speech; Speech recognition; Spectral factorization; speaker recognition; speech recognition; speech separation; voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • Conference_Location
    Bucharest
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6334329