DocumentCode :
542347
Title :
Location-based sound segregation
Author :
Roman, Nicoleta ; Wang, DeLiang ; Brown, Guy J.
Author_Institution :
Department of Computer and Information Science and Center for Cognitive Science, The Ohio State University, Columbus, 43210, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
At a cocktail party, we can selectively attend to a single voice and filter out all the other acoustical interferences. How to simulate this perceptual ability remains a great challenge. This paper describes a novel location-based approach for speech segregation. The auditory masking effect motivates the notion of an “ideal” time-frequency binary mask, which selects the target if it is stronger than the interference in a local time-frequency region. We observe that within a narrow frequency band modifications to the relative energy of the target source with respect to the interfering energy trigger systematic deviations for binaural cues. For a given spatial configuration, this interaction produces characteristic clustering in the binaural feature space. Consequently, we perform pattern classification in order to estimate ideal binary masks. A systematic evaluation shows that the resulting system produces masks very close to ideal binary ones, and large improvement over previous models.
Keywords :
Computer architecture; Ear; Estimation; Heating; Indium tin oxide; Signal to noise ratio; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743966
Filename :
5743966
Link To Document :
بازگشت