• DocumentCode
    2633038
  • Title

    A Probabilistic Graphical Model for Microphone Array Source Separation using Rich Pre-Trained Source Models

  • Author

    Attias, H.T.

  • Author_Institution
    Golden Metallic, Inc., San Francisco, CA
  • fYear
    2006
  • fDate
    12-14 July 2006
  • Firstpage
    641
  • Lastpage
    645
  • Abstract
    Voice based computing applications, such as phone communication and speech recognition, use microphone arrays to capture voice from a human speaker. In many environments of interest, however, sounds from other sources interfere with the speaker´s voice, posing severe problems for subsequent processing. This paper describes a new framework for treating this problem, and presents and demonstrates a new algorithm for the cancellation of interfering sounds. Our framework combines techniques from statistical machine learning with ideas from speech and audio processing. An important feature involves training rich probabilistic models on data from different types of relevant sound sources. Those source models are then incorporated into a larger probabilistic model of the observed microphone data. Using that model we derive our algorithm, which is of the expectation-maximization type and infers from data the clean sound of separate individual sources. We report very good results on data recorded in different environments
  • Keywords
    acoustic signal processing; expectation-maximisation algorithm; interference suppression; learning (artificial intelligence); microphone arrays; probability; source separation; expectation-maximization; interfering sounds cancellation; microphone array source separation; probabilistic graphical model; rich pre-trained source models; sound sources; source separation; statistical machine learning; Computer applications; Graphical models; Humans; Loudspeakers; Machine learning; Machine learning algorithms; Microphone arrays; Source separation; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Sensor Array and Multichannel Processing, 2006. Fourth IEEE Workshop on
  • Conference_Location
    Waltham, MA
  • Print_ISBN
    1-4244-0308-1
  • Type

    conf

  • DOI
    10.1109/SAM.2006.1706212
  • Filename
    1706212