• DocumentCode
    177428
  • Title

    Bayesian analysis of similarity matrices for speaker diarization

  • Author

    Sholokhov, Alexey ; Pekhovsky, Timur ; Kudashev, Oleg ; Shulipa, Andrei ; Kinnunen, Tomi

  • Author_Institution
    Sch. of Comput., Univ. of Eastern Finland, Kuopio, Finland
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    106
  • Lastpage
    110
  • Abstract
    Inspired by recent success of speaker clustering in Total Variability space we propose a new probabilistic model for speaker diarization based on Bayesian modeling of pairwise similarity scores. The recordings are represented by symmetric similarity matrices of likelihood ratio scores from probabilistic linear discriminant analysis (PLDA) trained on short-term i-vectors. We employ Bayesian approach to address the problem of unknown number of speakers in conversation. Diarization error rates on the NIST 2008 SRE telephone data indicate comparable performance with state-of-the-art eigenvoice-based diarization. But unlike the eigenvoice approach, our method finds the number of speakers automatically, making the proposed model more viable for practical applications.
  • Keywords
    Bayes methods; pattern clustering; speaker recognition; vectors; Bayesian analysis; Bayesian modeling; NIST 2008 SRE telephone data; PLDA; diarization error rates; likelihood ratio scores; pairwise similarity scores; probabilistic linear discriminant analysis; probabilistic model; recording representation; short-term i-vectors; speaker clustering; speaker diarization; symmetric similarity matrices; total variability space; Bayes methods; Computational modeling; Data models; NIST; Probabilistic logic; Speech; Vectors; Speaker diarization; similarity matrix; variational Bayesian inference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853567
  • Filename
    6853567