DocumentCode :
2712182
Title :
Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals
Author :
Khanagha, Vahid ; Khanagha, Ali
Author_Institution :
Iran Univ. of Sci. & Technol., Tehran, Iran
Volume :
2
fYear :
2009
fDate :
4-6 Oct. 2009
Firstpage :
1007
Lastpage :
1011
Abstract :
Multidimensional localization of multiple sources using BSS based TDOA estimators, requires the solution of global permutation ambiguity before fusing several TDOA estimations. Since the separation quality of BSS isn´t always perfect, it is not easy to decide which TDOA belongs to which source. Here we study the possibility of using several speaker specific features of speech signal in order to recognize perceptually dominant sources in each one of moderately separated outputs of BSS algorithm. We compare the feasibility of different features in terms of validity rate of decisions and computational complexity.
Keywords :
computational complexity; direction-of-arrival estimation; speech processing; time-domain analysis; TDOA estimators; computational complexity; global permutation ambiguity; multidimensional localization; speaker specific features; speech signals; time domain BSS; validity rate; Computational complexity; Data mining; Frequency; Industrial electronics; Microphone arrays; Predictive models; Production systems; Sensor arrays; Speech processing; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Electronics & Applications, 2009. ISIEA 2009. IEEE Symposium on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-4681-0
Electronic_ISBN :
978-1-4244-4683-4
Type :
conf
DOI :
10.1109/ISIEA.2009.5356310
Filename :
5356310
Link To Document :
بازگشت