• DocumentCode
    2537785
  • Title

    Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme

  • Author

    Kotti, Margarita ; Benetos, Emmanouil ; Kotropoulos, Constantine

  • Author_Institution
    Dept. of Informatics, Aristotle Univ. of Thessaloniki
  • fYear
    2006
  • fDate
    21-24 May 2006
  • Abstract
    This paper addresses unsupervised speaker change detection, a necessary step for several indexing tasks. We assume that there is no prior knowledge either on the number of speakers or their identities. Features included in the MPEG-7 audio prototype are investigated such as the AudioWaveformEnvelope and the AudioSpectrumCentroid. The model selection criterion is the Bayesian information criterion (BIC). A multiple pass algorithm is proposed. It uses a dynamic thresholding for scalar features and a fusion scheme so as to refine the segmentation results. It also models every speaker by a multivariate Gaussian probability density function and whenever new information is available, the respective model is updated. The experiments are carried out on a dataset created by concatenating speakers from the TIMIT database, that is referred to as the TIMIT data set. It is and demonstrated that the performance of the proposed multiple pass algorithm is better than that of other approaches
  • Keywords
    Gaussian processes; audio signal processing; probability; speaker recognition; Bayesian information criterion; MPEG-7 audio prototype; TIMIT database; automatic speaker change detection; dynamic thresholding; fusion scheme; model selection criterion; multiple pass algorithm; multivariate Gaussian probability density function; unsupervised speaker change detection; Artificial intelligence; Bayesian methods; Change detection algorithms; Hidden Markov models; Indexing; Informatics; Information analysis; Laboratories; MPEG 7 Standard; Maximum likelihood estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2006. ISCAS 2006. Proceedings. 2006 IEEE International Symposium on
  • Conference_Location
    Island of Kos
  • Print_ISBN
    0-7803-9389-9
  • Type

    conf

  • DOI
    10.1109/ISCAS.2006.1692970
  • Filename
    1692970