• DocumentCode
    1848594
  • Title

    Catalog-based single-channel speech-music separation with the Itakura-Saito divergence

  • Author

    Demir, Cemil ; Cemgil, A. Taylan ; Saraclar, Murat

  • Author_Institution
    TUBITAK-BILGEM, Kocaeli, Turkey
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    2812
  • Lastpage
    2816
  • Abstract
    In this study, we introduce a catalog-based single-channel speech-music separation method with the Itakura-Saito (IS) divergence measure. Previously, we have developed the catalog-based separation method with the Kullback-Leibler (KL) divergence. In the probabilistic point of view, IS divergence corresponds to a complex Gaussian observation model. Comparison of divergence measures or observation models in speech-music separation task is carried out with both of catalog-based and traditional Non-Negative Matrix Factorization (NMF) methods. The separation performance is compared using Speech-to-Music Ratio (SMR), Speech-to-Artifact Ratio (SAR) and speech recognition performance measure via the Word Error Rate (WER). We showed that, using IS divergence in both of catalog-based or NMF based speech-music separation methods yields better separation performance than KL divergence. Moreover, in this study, it is shown that catalog-based approaches with both divergence measures outperform traditional NMF based approaches in speech recognition experiments.
  • Keywords
    speech recognition; IS divergence; Itakura-Saito divergence; KL divergence; Kullback-Leibler divergence; NMF based speech-music separation methods; NMFmethods; SAR; SMR; catalog-based separation method; catalog-based single-channel speech-music separation; catalog-based speech-music separation methods; complex Gaussian observation model; nonnegative matrix factorization; probabilistic point; separation performance; speech recognition experiments; speech recognition performance; speech-to-artifact ratio; speech-to-music ratio; word error rate; Decision support systems; Europe; Mercury (metals); Noise measurement; Signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • Conference_Location
    Bucharest
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6333917