DocumentCode :
2067238
Title :
A Two-Stage Multi-Feature Integration Approach to Unsupervised Speaker Change Detection in Real-Time News Broadcasting
Author :
Xie, Lei ; Wang, Guangsen
Author_Institution :
Speech & Language Process. Group, Northwestern Polytech. Univ., China
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents a two-stage multi-feature integration approach for unsupervised speaker change detection in real-time news broadcasting. We integrate MFCC and LSP features (i.e. a perceptual feature plus a articulatory feature) in the metric-based potential speaker change detection stage to collect speaker boundary candidates as many as possible. We adopt a weighted Bayesian information criterion (BIC) to integrate boundary decisions from MFCC and LSP features in the speaker boundary confirmation stage. This multi-feature integration strategy makes use of the complementarity between perceptual features and articulatory features to achieve a performance gain. Speaker change detection experiments show that the multi- feature integration approach significantly outperforms the individual features with relative improvements of 26% over the LSP-only approach and 6% over the MFCC-only approach.
Keywords :
broadcasting; real-time systems; speaker recognition; LSP; MFCC; real-time news broadcasting; two-stage multifeature integration approach; unsupervised speaker change detection; weighted Bayesian information criterion; Acoustic signal detection; Bayesian methods; Broadcasting; Decoding; Hidden Markov models; Loudspeakers; Mel frequency cepstral coefficient; Speech processing; Speech recognition; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
Type :
conf
DOI :
10.1109/CHINSL.2008.ECP.99
Filename :
4730353
Link To Document :
بازگشت