DocumentCode :
2357815
Title :
On the disjointess of sources in music using different time-frequency representations
Author :
Giannoulis, Dimitrios ; Barchiesi, Daniele ; Klapuri, Anssi ; Plumbley, Mark D.
Author_Institution :
Centre for Digital Music, Queen Mary Univ. of London, London, UK
fYear :
2011
fDate :
16-19 Oct. 2011
Firstpage :
261
Lastpage :
264
Abstract :
This paper studies the disjointness of the time-frequency representations of simultaneously playing musical instruments. As a measure of disjointness, we use the approximate W-disjoint orthogonality as proposed by Yilmaz and Rickard [1], which (loosely speaking) measures the degree of overlap of different sources in the time-frequency domain. The motivation for this study is to find a maximally disjoint representation in order to facilitate the separation and recognition of musical instruments in mixture signals. The transforms investigated in this paper include the short-time Fourier transform (STFT), constant-Q transform, modified discrete cosine transform (MDCT), and pitch-synchronous lapped orthogonal transforms. Simulation results are reported for a database of polyphonic music where the multitrack data (instrument signals before mixing) were available. Absolute performance varies depending on the instrument source in question, but on the average MDCT with 93 ms frame size performed best.
Keywords :
Fourier transforms; acoustic signal processing; discrete cosine transforms; music; musical instruments; source separation; time-frequency analysis; MDCT; STFT; W-disjoint orthogonality; constant-Q transform; disjoint representation; modified discrete cosine transform; multitrack data; music; musical instrument recognition; musical instrument separation; pitch-synchronous lapped orthogonal transforms; polyphonic music; short-time Fourier transform; source disjointess; time-frequency domain; time-frequency representations; Conferences; Instruments; Source separation; Speech; Time frequency analysis; Transforms; MDCT; Source separation; W-disjoint orthogonality; constant Q transform; pitch-synchronous analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011 IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Print_ISBN :
978-1-4577-0692-9
Electronic_ISBN :
1931-1168
Type :
conf
DOI :
10.1109/ASPAA.2011.6082321
Filename :
6082321
Link To Document :
بازگشت