مرکز منطقه ای اطلاع رساني علوم و فناوري - Efficient model-based speech separation and denoising using non-negative subspace analysis

DocumentCode :

3411899

Title :

Efficient model-based speech separation and denoising using non-negative subspace analysis

Author :

Rennie, Steven J. ; Hershey, John R. ; Olsen, Peder A.

Author_Institution :

Thomas J. Watson Res. Center, IBM, Endicott, NY

fYear :

2008

fDate :

March 31 2008-April 4 2008

Firstpage :

1833

Lastpage :

1836

Abstract :

We present a new probabilistic architecture for analyzing composite non-negative data, called Non-negative Subspace Analysis (NSA). The NSA model provides a framework for understanding the relationships between sparse subspace and mixture model based approaches, and encompasses a range of models, including Sparse Non-negative Matrix Factorization (SNMF) [1] and mixture-model based analysis as special cases. We present a convenient instantiation of the NSA model, and an efficient variational approximate learning and inference algorithm that combines the advantages of SNMF and mixture model-based approaches. Preliminary recognition results on the Pascal Speech Separation Challenge 2006 test set [2], based on NSA separation results, are presented. The results fall short of those achieved by Algonquin [3], a state-of-the-art mixture-model based method, but considering that NSA runs an order of magnitude faster, the results are impressive. NSA outperforms SNMF in terms of word error rate (WER) on the task by a significant margin of over 9% absolute.

Keywords :

matrix decomposition; signal denoising; source separation; speech processing; inference algorithm; learning algorithm; nonnegative subspace analysis; probabilistic architecture; signal denoising; sparse nonnegative matrix factorization; sparse subspace; speech separation; Data analysis; Error analysis; Inference algorithms; Iterative methods; Noise reduction; Robustness; Sparse matrices; Speech analysis; Speech recognition; Testing; Non-negative Subspace Analysis (NSA); Robust Speech Recognition; Sparse Non-negative Matrix Factorization (SNMF); Speech Separation; Variational Expectation-Maximization (GEM);

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location :

Las Vegas, NV

ISSN :

1520-6149

Print_ISBN :

978-1-4244-1483-3

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2008.4517989

Filename :

4517989

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3411899