مرکز منطقه ای اطلاع رساني علوم و فناوري - On linear and mixmax interaction models for single channel source separation

DocumentCode :

3143738

Title :

On linear and mixmax interaction models for single channel source separation

Author :

Peharz, Robert ; Pernkopf, Franz

Author_Institution :

Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz, Austria

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

249

Lastpage :

252

Abstract :

For model-based single channel source separation, one typically assumes a linear interaction model, i.e. that the mixture magnitude spectrogram is the sum of the individual source magnitude spectrograms. In the log-domain, the MIXMAX interaction model is the corresponding approximation for the linear model. Hence, one would expect similar performance for both approaches. However, in this paper we empirically show that this is not the case for vector-quantizer-based (VQ) single channel source separation. We propose factorial linear-VQ, the linear counterpart to factorial max-VQ, and compare the two methods in systematic source separation experiments. Linear-VQ performs significantly better than max-VQ for comparable code-book sizes and behaves more robustly in the presence of additive white noise. Furthermore, we compare resynthesis properties of binary and continuous time-frequency masks. While binary masks achieve a higher interference suppression, the use of continuous masks results in a consistently better signal quality.

Keywords :

approximation theory; audio signal processing; interference suppression; minimax techniques; MIXMAX interaction models; VQ based-single channel source separation; additive white noise; approximation; audio source separation; binary time-frequency masks; code-book sizes; continuous time-frequency masks; factorial linear-VQ; factorial max-VQ; individual source magnitude spectrogram; interference suppression; linear interaction models; log-domain; mixture magnitude spectrogram; model-based single channel source separation; vector-quantizer-based single channel source separation; Approximation methods; Hidden Markov models; Signal to noise ratio; Source separation; Speech; Time frequency analysis; VQ; single channel; source separation; time-frequency masking;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6287864

Filename :

6287864

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3143738