Title :
Speaker indexing and speech enhancement in real meetings / conversations
Author :
Araki, Shoko ; Fujimoto, Masakiyo ; Ishizuka, Kentaro ; Sawada, Hiroshi ; Makino, Shoji
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto
fDate :
March 31 2008-April 4 2008
Abstract :
This paper presents a speaker indexing method that uses a small number of microphones to estimate who spoke when. Our proposed speaker indexing is realized by using a noise robust voice activity detector (VAD), a QCC-PHAT based direction of arrival (DOA) estimator, and a DOA classifier. Using the estimated speaker indexing information, we can also enhance the utterances of each speaker with a maximum signal-to-noise-ratio (MaxSNR) beamformer. This paper applies our system to real recorded meetings / conversations recorded in a room with a reverberation time of 350 ms, and evaluates the performance by a standard measure: the diarization error rate (DER). Even for the real conversations, which have many speaker turn-takings and overlaps, the speaker error time was very small with our proposed system. We are planning to demonstrate a real-time speaker indexing system at ICASSP2008.
Keywords :
direction-of-arrival estimation; indexing; reverberation; signal classification; speaker recognition; speech enhancement; DOA classifier; conversation recording; diarization error rate; direction of arrival estimator; microphones; noise robust voice activity detector; real recorded meetings; reverberation time; speaker indexing; speech enhancement; time 350 ms; Detectors; Direction of arrival estimation; Error analysis; Indexing; Measurement standards; Microphones; Noise robustness; Reverberation; Signal to noise ratio; Speech enhancement; Speaker indexing; diarization; maximum SNR beamformer; voice activity detector;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517554