• DocumentCode
    2808395
  • Title

    Blind audio source separation using short+long term AR source models and spectrum matching

  • Author

    Schutz, Antony ; Slock, Dirk

  • Author_Institution
    Mobile Commun. Dept., EURECOM, Sophia Antipolis, France
  • fYear
    2011
  • fDate
    4-7 Jan. 2011
  • Firstpage
    112
  • Lastpage
    115
  • Abstract
    Blind audio source separation (BASS) arises in a number of applications in speech and music processing such as speech enhancement, speaker diarization, automated music transcription etc. Generally, BASS methods consider multichannel signal capture. The single microphone case is the most difficult underdetermined case, but it often arises in practice. In the approach considered here, the main source identifiability comes from exploiting the presumed quasi-periodic nature of the sources via long-term autoregressive (AR) modeling. Indeed, musical note signals are quasi-periodic and so is voiced speech, which constitutes the most energetic part of speech signals. We furthermore exploit (e.g. speaker or instrument related) prior information in the spectral envelope of the source signals via short-term AR modeling. We present an iterative method based on the minimization of the (weighted) Itakura-Saito distance for estimating the source parameters directly from the mixture using frame based processing.
  • Keywords
    autoregressive processes; blind source separation; iterative methods; AR source model; autoregressive modeling; blind audio source separation; iterative method; spectrum matching; speech and music processing; Correlation; Estimation; Frequency estimation; Hidden Markov models; Minimization; Source separation; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing Workshop and IEEE Signal Processing Education Workshop (DSP/SPE), 2011 IEEE
  • Conference_Location
    Sedona, AZ
  • Print_ISBN
    978-1-61284-226-4
  • Type

    conf

  • DOI
    10.1109/DSP-SPE.2011.5739196
  • Filename
    5739196