• DocumentCode
    705199
  • Title

    A perceptually enhanced blind single-channel audio source separation by non-negative matrix factorization

  • Author

    Kirbiz, S. ; Gunsel, B.

  • Author_Institution
    Dept. of Electron. & Commun. Eng., Istanbul Tech. Univ., Istanbul, Turkey
  • fYear
    2010
  • fDate
    23-27 Aug. 2010
  • Firstpage
    731
  • Lastpage
    735
  • Abstract
    This paper proposes a 2D Non-negative Matrix Factorization (NMF) based single-channel source separation algorithm that emphasizes perceptually important components of audio. Unlike the existing methods, the proposed scheme performs a psychoacoustic pre-processing on the mixture spectrogram in order to supress audio components that are not critical to human hearing sensation while amplifying the perceptually important ones. This yields the auditory spectrogram referred as sonogram of the observed audio mixture and the individual sources are then extracted by 2D NMF. Test results reported in terms of Signal-to-Distortion-Ratio (SDR), Signal-to-Inference-Ratio (SIR) and Signal-to-Artifact-Ratio (SAR) show that the proposed perceptually enhanced separation improves the quality of decomposed audio sources by 1.5-6.5 dB with a reduced computational complexity.
  • Keywords
    audio signal processing; blind source separation; computational complexity; matrix decomposition; 2D NMF; NMF based single-channel source separation algorithm; SAR; SDR; SIR; audio component suppression; auditory spectrogram; computational complexity; human hearing sensation; mixture spectrogram; nonnegative matrix factorization; perceptually enhanced blind single-channel audio source separation; psychoacoustic pre-processing; signal-to-artifact-ratio; signal-to-distortion-ratio; signal-to-inference-ratio; sonogram; Psychoacoustic models; Psychoacoustics; Sonogram; Source separation; Spectrogram; Time-frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2010 18th European
  • Conference_Location
    Aalborg
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7096472