• DocumentCode
    2270405
  • Title

    Catalog-based single-channel speech-music separation for automatic speech recognition

  • Author

    Demir, Cemil ; Cemgil, A. Taylan ; Saraclar, Murat

  • Author_Institution
    TUBITAK-BILGEM, Kocaeli, Turkey
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    2133
  • Lastpage
    2137
  • Abstract
    In this study, we analyze the effect of the catalog-based single-channel speech-music separation method, which we proposed previously, on speech recognition performance. In the proposed method, assuming that we know a catalog of the background music, we developed a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). In this paper, we propose to recover the speech signals from the mixed signal in time-domain by detecting the active catalog frames using the catalog-based method. We compare the performances of 3 different signal reconstruction techniques; Expectation-Based, Posterior-Based and Time-Domain reconstruction. Moreover, we compare the performance of our system with the performance of the traditional NMF model. Our method outperforms the NMF method in ASR performance and separation performance in most experimental conditions.
  • Keywords
    matrix decomposition; signal reconstruction; speech recognition; time-domain analysis; ASR performance; NMF model; PMM; atalog-based single-channel speech-music separation; automatic speech recognition; conditional Poisson mixture model; expectation-based reconstruction; music spectrograms; nonnegative matrix factorization model; posterior-based reconstruction; signal reconstruction techniques; superposed speech; time-domain; time-domain reconstruction; Catalogs; Erbium; Multiple signal classification; Spectrogram; Speech; Speech recognition; Time-domain analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7074138