• DocumentCode
    1351834
  • Title

    A General Flexible Framework for the Handling of Prior Information in Audio Source Separation

  • Author

    Ozerov, Alexey ; Vincent, Emmanuel ; Bimbot, Frédéric

  • Author_Institution
    INRIA, Rennes Bretagne Atlantique, Rennes, France
  • Volume
    20
  • Issue
    4
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    1118
  • Lastpage
    1133
  • Abstract
    Most audio source separation methods are developed for a particular scenario characterized by the number of sources and channels and the characteristics of the sources and the mixing process. In this paper, we introduce a general audio source separation framework based on a library of structured source models that enable the incorporation of prior knowledge about each source via user-specifiable constraints. While this framework generalizes several existing audio source separation methods, it also allows to imagine and implement new efficient methods that were not yet reported in the literature. We first introduce the framework by describing the model structure and constraints, explaining its generality, and summarizing its algorithmic implementation using a generalized expectation-maximization algorithm. Finally, we illustrate the above-mentioned capabilities of the framework by applying it in several new and existing configurations to different source separation problems. We have released a software tool named Flexible Audio Source Separation Toolbox (FASST) implementing a baseline version of the framework in Matlab.
  • Keywords
    audio signal processing; expectation-maximisation algorithm; source separation; FASST; Matlab; above-mentioned capability; algorithmic implementation; audio source separation methods; flexible audio source separation toolbox; general audio source separation framework; general flexible framework; generalized expectation-maximization algorithm; mixing process; model constraints; model structure; prior information handling; prior knowledge; software tool; source separation problems; structured source models; user-specifiable constraints; Covariance matrix; Hidden Markov models; Libraries; Mathematical model; Source separation; Speech; Speech processing; Audio source separation; expectation–maximization; local Gaussian model; nonnegative matrix factorization;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2172425
  • Filename
    6047568