DocumentCode :
2575472
Title :
Factorial Scaled Hidden Markov Model for polyphonic audio representation and source separation
Author :
Ozerov, Alexey ; Févotte, Cédric ; Charbit, Maurice
fYear :
2009
fDate :
18-21 Oct. 2009
Firstpage :
121
Lastpage :
124
Abstract :
We present a new probabilistic model for polyphonic audio termed factorial scaled hidden Markov model (FS-HMM), which generalizes several existing models, notably the Gaussian scaled mixture model and the Itakura-Saito nonnegative matrix factorization (NMF) model. We describe two expectation-maximization (EM) algorithms for maximum likelihood estimation, which differ by the choice of complete data set. The second EM algorithm, based on a reduced complete data set and multiplicative updates inspired from NMF methodology, exhibits much faster convergence. We consider the FS-HMM in different configurations for the difficult problem of speech/music separation from a single channel and report satisfying results.
Keywords :
audio signal processing; expectation-maximisation algorithm; hidden Markov models; signal representation; source separation; Gaussian scaled mixture model; Itakura-Saito nonnegative matrix factorization model; expectation-maximization algorithm; factorial scaled hidden Markov model; maximum likelihood estimation; polyphonic audio; polyphonic audio representation; source separation; speech-music separation; Acoustic signal processing; Conferences; Convergence; Hidden Markov models; Maximum likelihood estimation; Music information retrieval; Signal processing algorithms; Source separation; Speech; Telecommunications; Factorial hidden Markov model; Gaussian scaled mixture models; audio source separation; expectation-maximization algorithm; nonnegative matrix factorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
Type :
conf
DOI :
10.1109/ASPAA.2009.5346527
Filename :
5346527
Link To Document :
بازگشت