DocumentCode :
3308838
Title :
An Improved Speaker Diarization System for Multiple Distance Microphone Meetings
Author :
Yu, Zhou ; Hongbin, Suo ; Junjie, Wang ; Yonghong, Yan
Author_Institution :
Key Lab. of Speech Acoust. & Content Understanding, Beijing, China
fYear :
2012
fDate :
12-14 Jan. 2012
Firstpage :
80
Lastpage :
83
Abstract :
This paper describes an improved speaker diarization system for multiple distance microphone (MDM) meeting conversations. First, the new system includes a modified speech activity detector (SAD). Second, it adopts the new spectral features based on equivalent rectangular bandwidth (ERB) or bark scale, which are compared with the traditional Mel Frequency Cepstral Coefficients (MFCC) features. Third, the system adapts the segment model from a universal background model (UBM). Finally, it is evaluated in the NIST RT-04s MDM conditions. Experimental results show that: (1) the new speech/non-speech detector out-performs the one in the baseline system, (2) the proposed spectral features are more effective than MFCC features for speaker diarization, (3) The adaptation of segment models from UBM helps improving the system performance. Together, these improvements lead to the diarization error rate of 15.38% on RT-04s evaluation data excluding overlapping speech.
Keywords :
speaker recognition; ERB; MDM; MFCC; Mel frequency cepstral coefficients; SAD; UBM; equivalent rectangular bandwidth; improved speaker diarization system; multiple distance microphone meetings; non-speech detector; segment models; speech activity detector; speech detector; universal background model; Adaptation models; Data models; Detectors; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; ERB scale; bark scale; speaker diarization; speech activity detector; universal background model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computation Technology and Automation (ICICTA), 2012 Fifth International Conference on
Conference_Location :
Zhangjiajie, Hunan
Print_ISBN :
978-1-4673-0470-2
Type :
conf
DOI :
10.1109/ICICTA.2012.27
Filename :
6150241
Link To Document :
بازگشت