DocumentCode :
3162447
Title :
Low-latency speaker diarization based on Bayesian information criterion with multiple phoneme classes
Author :
Oku, Takanori ; Sato, Seiki ; Kobayashi, Akihiro ; Homma, S. ; Imai, Tetsuro
Author_Institution :
Sci. & Technol. Res. Labs., NHK (Japan Broadcasting Corp.), Tokyo, Japan
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4189
Lastpage :
4192
Abstract :
Low-latency speaker diarization is desirable for online-oriented speaker adaptation in real-time speech recognition. Especially in spontaneous conversations, several speakers tend to speak alternatively and continuously without any silence in between utterances. We therefore propose a speaker diarization method that detects speaker-change points and determines the speaker with a fixed low latency on the basis of a Bayesian information criterion (BIC) by using acoustic features classified into multiple phoneme classes. To improve the accuracy of speaker diarization in the low latency condition, the speaker-decision is made continuously at each phoneme boundary. In an experiment on conversational broadcast news programs, our diarization method reduced the speaker diarization error rate relatively by 20.0% compared to the conventional BIC with a single phoneme class. The online speaker adaptation applied in a speech-recognition experiment reduced word error rate at speaker-change points relatively by 7.8%.
Keywords :
Bayes methods; acoustic signal processing; real-time systems; speaker recognition; Bayesian information criterion; acoustic features; low-latency speaker diarization; multiple phoneme classes; online speaker adaptation; online-oriented speaker adaptation; real-time speech recognition; speaker-change points; Acoustics; Adaptation models; Data models; Feature extraction; Real time systems; Speech; Speech recognition; BIC; phoneme classes; speaker adaptation; speaker diarization; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288842
Filename :
6288842
Link To Document :
بازگشت