DocumentCode :
3518021
Title :
Target speech extractionwith learned spectral bases
Author :
Park, Sunho ; Yoo, Jiho ; Choi, Seungjin
Author_Institution :
Dept. of Comput. Sci., POSTECH, Pohang
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
1789
Lastpage :
1792
Abstract :
In this paper we present a method for extracting a speech signal of target speaker from noisy convolutive mixtures of target speech and an interference source, when training utterances of the target speaker are available. We incorporate a statistical latent variable model into blind source separation (BSS), where we make use of spectral bases learned from the training utterances of the target speaker to identify which source corresponds to the target speaker. Combined with any existing BSS methods, our post-processing (which is the main contribution) consists of two steps: (1) channel selection where we identify the source corresponding to the target speaker; (2) enhancement where we further suppress the remaining interference. Numerical experiments confirm that our method substantially improves the separation quality of existing BSS methods and successfully restores the target speaker´s speech.
Keywords :
blind source separation; convolution; feature extraction; interference (signal); speaker recognition; spectral analysis; statistical analysis; blind source separation; channel selection; interference source; learned spectral bases; noisy convolutive mixtures; speech extraction; statistical latent variable model; target speaker utterances; Acoustic noise; Automatic speech recognition; Background noise; Blind source separation; Data mining; Interference suppression; Loudspeakers; Source separation; Speech enhancement; Working environment noise; Blind source separation; speech extraction; speech segregation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959952
Filename :
4959952
Link To Document :
بازگشت