• DocumentCode
    2357815
  • Title

    On the disjointess of sources in music using different time-frequency representations

  • Author

    Giannoulis, Dimitrios ; Barchiesi, Daniele ; Klapuri, Anssi ; Plumbley, Mark D.

  • Author_Institution
    Centre for Digital Music, Queen Mary Univ. of London, London, UK
  • fYear
    2011
  • fDate
    16-19 Oct. 2011
  • Firstpage
    261
  • Lastpage
    264
  • Abstract
    This paper studies the disjointness of the time-frequency representations of simultaneously playing musical instruments. As a measure of disjointness, we use the approximate W-disjoint orthogonality as proposed by Yilmaz and Rickard [1], which (loosely speaking) measures the degree of overlap of different sources in the time-frequency domain. The motivation for this study is to find a maximally disjoint representation in order to facilitate the separation and recognition of musical instruments in mixture signals. The transforms investigated in this paper include the short-time Fourier transform (STFT), constant-Q transform, modified discrete cosine transform (MDCT), and pitch-synchronous lapped orthogonal transforms. Simulation results are reported for a database of polyphonic music where the multitrack data (instrument signals before mixing) were available. Absolute performance varies depending on the instrument source in question, but on the average MDCT with 93 ms frame size performed best.
  • Keywords
    Fourier transforms; acoustic signal processing; discrete cosine transforms; music; musical instruments; source separation; time-frequency analysis; MDCT; STFT; W-disjoint orthogonality; constant-Q transform; disjoint representation; modified discrete cosine transform; multitrack data; music; musical instrument recognition; musical instrument separation; pitch-synchronous lapped orthogonal transforms; polyphonic music; short-time Fourier transform; source disjointess; time-frequency domain; time-frequency representations; Conferences; Instruments; Source separation; Speech; Time frequency analysis; Transforms; MDCT; Source separation; W-disjoint orthogonality; constant Q transform; pitch-synchronous analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011 IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • ISSN
    1931-1168
  • Print_ISBN
    978-1-4577-0692-9
  • Electronic_ISBN
    1931-1168
  • Type

    conf

  • DOI
    10.1109/ASPAA.2011.6082321
  • Filename
    6082321