Title :
Benefits of prior acoustic segmentation for automatic speaker segmentation
Author :
Meignier, Sylvain ; Moraru, Daniel ; Fredouille, Corinne ; Besacier, Laurent ; Bonastre, Jean-Francois
Author_Institution :
LIA-Avignon, Avignon, France
Abstract :
The paper investigates the interest of segmentation in acoustic macro classes (like gender or bandwidth) as front-end processing for the segmentation/diarization task. The impact of this prior acoustic segmentation is evaluated in terms of speaker diarization performance in the particular context of NIST RT´03 evaluation (done on the HUB4 broadcast news corpora). It is rarely discussed in the literature, but our work shows that the application of prior acoustic segmentation, in a similar way to the automatic speech recognition task, may be very useful to the speaker segmentation task. Experiments were conducted using two different kinds of speaker segmentation systems developed individually by the LIA and CLIPS laboratories in the framework of the ELISA consortium. For both systems, improvement was observed when combined with prior acoustic segmentation. However, a larger impact, in terms of performance, is observed on the LIA system based on an ascending/HMM approach compared to the CLIPS system based on speaker turn detection.
Keywords :
acoustic signal processing; hidden Markov models; pattern classification; speaker recognition; acoustic macro classes; ascending/HMM approach; automatic speaker segmentation; automatic speech recognition; front-end processing; prior acoustic segmentation; speaker diarization; speaker identity; speaker turn detection; speech processing; Loudspeakers;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326006