• DocumentCode
    178005
  • Title

    An interactive audio source separation framework based on non-negative matrix factorization

  • Author

    Duong, Ngoc Q. K. ; Ozerov, Alexey ; Chevallier, Louis ; Sirot, Joel

  • Author_Institution
    Technicolor, Cesson Sévigné, France
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    1567
  • Lastpage
    1571
  • Abstract
    Though audio source separation offers a wide range of applications in audio enhancement and post-production, its performance has yet to reach the satisfactory especially for single-channel mixtures with limited training data. In this paper we present a novel interactive source separation framework that allows end-users to provide feedback at each separation step so as to gradually improve the result. For this purpose, a prototype graphical user interface (GUI) is developed to help users annotating time-frequency regions where a source can be labeled as either active, inactive, or well-separated within the displayed spectrogram. This user feedback information, which is partially new with respect to the state-of-the-art annotations, is then taken into account in a proposed uncertainty-based learning algorithm to constraint the source estimates in next separation step. The considered framework is based on non-negative matrix factorization and is shown to be effective even without using any isolated training data.
  • Keywords
    audio signal processing; feedback; graphical user interfaces; learning (artificial intelligence); matrix decomposition; source separation; GUI; audio enhancement; graphical user interface; interactive audio source separation framework; nonnegative matrix factorization; time-frequency annotation; uncertainty based learning algorithm; user feedback information; Acoustics; Source separation; Spectrogram; Speech; Time-frequency analysis; Training data; Interactive audio source separation; nonnegative matrix factorization; time-frequency annotation; uncertainty-based learning; user feedback;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853861
  • Filename
    6853861