• DocumentCode
    730083
  • Title

    Informed monaural source separation of music based on convolutional sparse coding

  • Author

    Ping-Keng Jao ; Yi-Hsuan Yang ; Wohlberg, Brendt

  • Author_Institution
    Res. Center for Inf. Technol. Innovation, Taipei, Taiwan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    236
  • Lastpage
    240
  • Abstract
    Monaural source separation is a challenging problem that has many important applications in music information retrieval. In this paper, we focus on the score-informed variant of this problem. While non-negative matrix factorization and some other approaches have been shown effective, few existing approaches have properly taken the phase information into account. There are unnatural sound in the separation result, as the phase of each source signal is considered equivalent to the phase of the mixed signal. To remedy this, we propose to perform source separation directly in the time domain using a convolutional sparse coding (CSC) approach. Evaluation on the Bach10 dataset shows that, when the instrument, pitch and onset/offset time are informed, the source to distortion ratio of the separation result reaches 8.59 dB, which is 2.02 dB higher than a state-of-the-art system called Soundprism.
  • Keywords
    acoustic signal processing; audio coding; convolutional codes; musical acoustics; time-domain analysis; Soundprism; convolutional sparse coding; monaural source separation; music; nonnegative matrix factorization; sound source signal; time domain; Convolution; Convolutional codes; Dictionaries; Instruments; Source separation; Speech; Time-domain analysis; Convolutional sparse coding; dictionary learning; score-informed monaural source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177967
  • Filename
    7177967