DocumentCode
1534123
Title
A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation
Author
Durrieu, Jean-Louis ; David, Barak ; Richard, Guilhem
Author_Institution
Signal Process. Labs. (LTS5), Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
Volume
5
Issue
6
fYear
2011
Firstpage
1180
Lastpage
1191
Abstract
When designing an audio processing system, the target tasks often influence the choice of a data representation or transformation. Low-level time-frequency representations such as the short-time Fourier transform (STFT) are popular, because they offer a meaningful insight on sound properties for a low computational cost. Conversely, when higher level semantics, such as pitch, timbre or phoneme, are sought after, representations usually tend to enhance their discriminative characteristics, at the expense of their invertibility. They become so-called mid-level representations. In this paper, a source/filter signal model which provides a mid-level representation is proposed. This representation makes the pitch content of the signal as well as some timbre information available, hence keeping as much information from the raw data as possible. This model is successfully used within a main melody extraction system and a lead instrument/accompaniment separation system. Both frameworks obtained top results at several international evaluation campaigns.
Keywords
Fourier transforms; audio signal processing; music; source separation; audio processing system; low-level time-frequency representations; melody extraction system; musical audio source separation; musically motivated mid-level representation; pitch content; pitch estimation; short-time Fourier transform; Dictionaries; Estimation; Harmonic analysis; Instruments; Spectral shape; Timbre; Time frequency analysis; Audio melody extraction; audio signal representation; musical audio source separation; non-negative matrix factorization (NMF); pitch estimation;
fLanguage
English
Journal_Title
Selected Topics in Signal Processing, IEEE Journal of
Publisher
ieee
ISSN
1932-4553
Type
jour
DOI
10.1109/JSTSP.2011.2158801
Filename
5784290
Link To Document