• DocumentCode
    2955679
  • Title

    Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches

  • Author

    Kotti, Margarita ; Martins, Luís Gustavo P M ; Benetos, Emmanouil ; Cardoso, Jaime S. ; Kotropoulos, Constantine

  • Author_Institution
    Dept. of Informatics, Thessaloniki Univ.
  • fYear
    2006
  • fDate
    9-12 July 2006
  • Firstpage
    1101
  • Lastpage
    1104
  • Abstract
    This paper addresses the problem of unsupervised speaker change detection. Three systems based on the Bayesian information criterion (BIC) are tested. The first system investigates the AudioSpectrumCentroid and the AudioWaveformEnvelope features, implements a dynamic thresholding followed by a fusion scheme, and finally applies BIC. The second method is a real-time one that uses a metric-based approach employing the line spectral pairs and the BIC to validate a potential speaker change point. The third method consists of three modules. In the first module, a measure based on second-order statistics is used; in the second module, the Euclidean distance and T2 hotelling statistic are applied; and in the third module, the BIC is utilized. The experiments are carried out on a dataset created by concatenating speakers from the TIMIT database, that is referred to as the TIMIT data set. A comparison between the performance of the three systems is made based on t-statistics
  • Keywords
    Bayes methods; audio signal processing; sensor fusion; speaker recognition; statistical analysis; BIC; Bayesian information criterion; Euclidean distance; TIMIT database; audio spectrum centroid; audio waveform envelope feature; automatic speaker segmentation; distance measure; fusion scheme; line spectral pair; second-order statistics; speaker change detection; Bayesian methods; Databases; Euclidean distance; Hidden Markov models; Indexing; Informatics; Speech; Statistics; Streaming media; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2006 IEEE International Conference on
  • Conference_Location
    Toronto, Ont.
  • Print_ISBN
    1-4244-0366-7
  • Electronic_ISBN
    1-4244-0367-7
  • Type

    conf

  • DOI
    10.1109/ICME.2006.262727
  • Filename
    4036796