مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints

DocumentCode :

1947990

Title :

Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints

Author :

Yinan Li ; Xiongwei Zhang ; Meng Sun ; Gang Min

Author_Institution :

Lab. of Intell. Inf. Process., PLA Univ. of Sci. & Technol., Nanjing, China

fYear :

2015

fDate :

12-15 July 2015

Firstpage :

Lastpage :

Abstract :

Non-negative spectrogram decomposition and its variants have been extensively investigated for speech enhancement due to their efficiency in extracting perceptually meaningful components from mixtures. Usually, these approaches are implemented on the condition that training samples for one or more sources are available beforehand. However, in many real-world scenarios, it is always impossible for conducting any prior training. To solve this problem, we proposed an approach which directly extracts the representations of background noises from the noisy speech via imposing non-negative constraints on the low-rank and sparse decomposition of the noisy spectrogram. The noise representations are subsequently utilized when estimating the clean speech. In this technique, potential spectral structural regularity could be discovered for better reconstruction of clean speech. Evaluations on the Noisex-92 and TIMIT database showed that the proposed method achieves significant improvements over the state-of-the-art methods in unsupervised speech enhancement.

Keywords :

matrix decomposition; signal reconstruction; signal representation; source separation; spectral analysis; speech enhancement; Noisex-92 database; TIMIT database; background noise representation extraction; clean speech reconstruction; low-rank constraints; noisy speech; nonnegative constraint; nonnegative matrix factorization; nonnegative spectrogram decomposition; perceptually meaningful component extraction; robust NMF; sparse constraints; spectral structural regularity; unsupervised monaural speech enhancement; unsupervised speech enhancement; Noise; Noise measurement; Sparse matrices; Spectrogram; Speech; Speech enhancement; Time-frequency analysis; low-rank and sparse decomposition; non-negative matrix factorization; speech enhancement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on

Conference_Location :

Chengdu

Type :

conf

DOI :

10.1109/ChinaSIP.2015.7230350

Filename :

7230350

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1947990