DocumentCode
2633038
Title
A Probabilistic Graphical Model for Microphone Array Source Separation using Rich Pre-Trained Source Models
Author
Attias, H.T.
Author_Institution
Golden Metallic, Inc., San Francisco, CA
fYear
2006
fDate
12-14 July 2006
Firstpage
641
Lastpage
645
Abstract
Voice based computing applications, such as phone communication and speech recognition, use microphone arrays to capture voice from a human speaker. In many environments of interest, however, sounds from other sources interfere with the speaker´s voice, posing severe problems for subsequent processing. This paper describes a new framework for treating this problem, and presents and demonstrates a new algorithm for the cancellation of interfering sounds. Our framework combines techniques from statistical machine learning with ideas from speech and audio processing. An important feature involves training rich probabilistic models on data from different types of relevant sound sources. Those source models are then incorporated into a larger probabilistic model of the observed microphone data. Using that model we derive our algorithm, which is of the expectation-maximization type and infers from data the clean sound of separate individual sources. We report very good results on data recorded in different environments
Keywords
acoustic signal processing; expectation-maximisation algorithm; interference suppression; learning (artificial intelligence); microphone arrays; probability; source separation; expectation-maximization; interfering sounds cancellation; microphone array source separation; probabilistic graphical model; rich pre-trained source models; sound sources; source separation; statistical machine learning; Computer applications; Graphical models; Humans; Loudspeakers; Machine learning; Machine learning algorithms; Microphone arrays; Source separation; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Sensor Array and Multichannel Processing, 2006. Fourth IEEE Workshop on
Conference_Location
Waltham, MA
Print_ISBN
1-4244-0308-1
Type
conf
DOI
10.1109/SAM.2006.1706212
Filename
1706212
Link To Document