DocumentCode :
2556202
Title :
Speech activity and speaker novelty detection methods for meeting processing
Author :
Sugiyama, Masahide ; Markov, Konstantin ; Ronzhin, Andrey ; Budkov, Victor ; Karpov, Alexey ; Prischepa, Maria
Author_Institution :
Human Interface Lab., Univ. of Aizu, Fukushima, Japan
fYear :
2009
fDate :
12-14 Oct. 2009
Firstpage :
1
Lastpage :
6
Abstract :
Segmentation of multi-speaker meeting audio data recorded with several microphones into speech/silence frames is one of the first tasks at development of the speaker diarization system. Energy normalization techniques and signal correlation methods are used in order to avoid the crosstalk problem, in which participant´s speech appears on other participants´ microphones. A comparison of different types of microphones and a configuration of the recording devices implemented inside the intelligent meeting room are described. Special attention is paid to improvement of the novelty detection performance of the on-line speaker diarization system.
Keywords :
microphones; speech processing; energy normalization techniques; meeting processing; multi-speaker segmentation; speaker diarization system; speaker novelty detection methods; speech activity; Audio recording; Humans; Informatics; Laboratories; Loudspeakers; Microphones; NIST; Signal processing; Speech processing; Speech recognition; multimodal interfaces; sound source localization; speaker diarization; voice activity detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Ultra Modern Telecommunications & Workshops, 2009. ICUMT '09. International Conference on
Conference_Location :
St. Petersburg
Print_ISBN :
978-1-4244-3942-3
Electronic_ISBN :
978-1-4244-3941-6
Type :
conf
DOI :
10.1109/ICUMT.2009.5345325
Filename :
5345325
Link To Document :
بازگشت