Title :
Speech activity and speaker novelty detection methods for meeting processing
Author :
Sugiyama, Masahide ; Markov, Konstantin ; Ronzhin, Andrey ; Budkov, Victor ; Karpov, Alexey ; Prischepa, Maria
Author_Institution :
Human Interface Lab., Univ. of Aizu, Fukushima, Japan
Abstract :
Segmentation of multi-speaker meeting audio data recorded with several microphones into speech/silence frames is one of the first tasks at development of the speaker diarization system. Energy normalization techniques and signal correlation methods are used in order to avoid the crosstalk problem, in which participant´s speech appears on other participants´ microphones. A comparison of different types of microphones and a configuration of the recording devices implemented inside the intelligent meeting room are described. Special attention is paid to improvement of the novelty detection performance of the on-line speaker diarization system.
Keywords :
microphones; speech processing; energy normalization techniques; meeting processing; multi-speaker segmentation; speaker diarization system; speaker novelty detection methods; speech activity; Audio recording; Humans; Informatics; Laboratories; Loudspeakers; Microphones; NIST; Signal processing; Speech processing; Speech recognition; multimodal interfaces; sound source localization; speaker diarization; voice activity detection;
Conference_Titel :
Ultra Modern Telecommunications & Workshops, 2009. ICUMT '09. International Conference on
Conference_Location :
St. Petersburg
Print_ISBN :
978-1-4244-3942-3
Electronic_ISBN :
978-1-4244-3941-6
DOI :
10.1109/ICUMT.2009.5345325