Title :
Nuts and Flakes: a Study of Data Characteristics in Speaker Diarization
Author :
Mirghafori, Nikki ; Wooters, Chuck
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA
Abstract :
Researchers in the speaker diarization community have observed that some audio files show unusually high diarization error rates (DER) (hard to crack "nuts"), and some exhibit hyper-sensitivity to tuning parameters ("flakes"). The goal of this study is to systematically study the features that correlate with such behavior. We calculated over forty features for each of 24 shows from the broadcast news corpus along the dimensions of speaker count, conversation turn, and speaker and show duration. We observed that number of speakers, number of turns, and do-nothing DER (a measure related to the percentage of time the dominant speaker spoke) correlated best with "nuttiness". The do-nothing DER and number of speakers were also the best correlates of "flakiness"
Keywords :
speech processing; broadcast news corpus; data characteristics; diarization error rates; flakes; nuts; speaker diarization; Audio recording; Broadcasting; Cellular neural networks; Computer science; Contracts; Density estimation robust algorithm; Error analysis; NIST; Optimal matching; Time measurement;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660196