DocumentCode :
699283
Title :
Voice separation of overlapping speech using tracking techniques and the gating process
Author :
Potamitis, Ilyas ; Zervas, Panos ; Fakotakis, Nikos
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Patras, Patras, Greece
fYear :
2004
fDate :
6-10 Sept. 2004
Firstpage :
1119
Lastpage :
1122
Abstract :
This paper investigates the use of tracking techniques successfully applied to aircraft tracking and navigation to segment possibly overlapping speech of multiple static speakers in an enclosure. The tracking technique applied, namely the probabilistic data association (PDA) in conjunction with the interacting multiple model (IMM) estimator directly accounts for measurement origin uncertainty, i.e., which direction of arrival (DOA) measurement comes from which speaker and rejects spurious DOAs. The estimated DOAs are utilized by a single microphone array to provide separation through its directional receptive field. Based on the prediction of the IMM filter that constructs permissible DOA regions for each speaker (gates), we elaborate on the concept and application of the so called `gating process´ that can be utilized in the initialization and termination of speech tracks, thus serving as a voice activity detector (VAD). The effectiveness of the approach is illustrated by extensive simulation study on tracking and separating three static speakers having a conversation with partially overlapping speech and long pauses.
Keywords :
aircraft navigation; direction-of-arrival estimation; speech processing; tracking; DOA measurement; IMM estimator; IMM filter prediction; PDA; VAD; aircraft navigation; aircraft tracking; direction-of-arrival measurement; directional receptive field; gating process; interacting multiple model; multiple-static speakers; overlapping speech segmentation; partially-overlapping speech; probabilistic data association; single-microphone array; speech track initialization; speech track termination; static speaker separation; static speaker tracking; tracking technique; voice activity detector; voice separation; Abstracts; Robustness; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7
Type :
conf
Filename :
7079813
Link To Document :
بازگشت