DocumentCode
698111
Title
Main instrument separation from stereophonic audio signals using a source/filter model
Author
Durrieu, Jean-Louis ; Ozerov, Alexey ; Fevotte, Cedric ; Richard, Gael ; David, Bertrand
Author_Institution
Inst. Telecom, Telecom ParisTech, Paris, France
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
15
Lastpage
19
Abstract
We propose a new approach to solo/accompaniment separation from stereophonic music recordings which extends a monophonic algorithm we recently proposed. The solo part is modelled using a source/filter model to which we added two contributions: an explicit smoothing strategy for the filter frequency responses and an unvoicing model to catch the stochastic parts of the solo voice. The accompaniment is modelled as a general instantaneous mixture of several components leading to a Nonnegative Matrix Factorization framework. The stereophonic signal is assumed to be the instantaneous mixture of the solo and accompaniment contributions. Both channels are then jointly used within a Maximum Likelihood framework to estimate all the parameters. Three rounds of parameter estimations are necessary to sequentially estimate the melody, the voiced part and at last the unvoiced part of the solo. Our tests show that there is a clear improvement from a monophonic reference system to the proposed stereophonic system, especially when including the unvoicing model. The smoothness of the filters does not provide the desired improvement in solo/accompaniment separation, but may be useful in future applications such as lyrics recognition. At last, our submissions to the Signal Separation Evaluation Campaign (SiSEC), for the “Professionally Produced Music Recordings” task, obtained very good results.
Keywords
audio recording; audio signal processing; matrix decomposition; maximum likelihood estimation; music; explicit smoothing strategy; filter frequency responses; filter model; main instrument separation; maximum likelihood framework; nonnegative matrix factorization framework; parameter estimations; source model; stereophonic audio signals; stereophonic music recordings; Adaptation models; Channel estimation; Hafnium; Instruments; Parameter estimation; Source separation; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077686
Link To Document