• DocumentCode
    730333
  • Title

    A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions

  • Author

    Tawara, Naohiro ; Ogawa, Tetsuji ; Kobayashi, Tetsunori

  • Author_Institution
    Dept. of Comput. Sci., Waseda Univ., Tokyo, Japan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2041
  • Lastpage
    2045
  • Abstract
    The present paper dealt with speaker clustering for speech corrupted by noise. In general, the performance of speaker clustering significantly depends on how well the similarities between speech utterances can be measured. The recently proposed i-vector-based cosine similarity has yielded the state-of-the-art performance in speaker clustering systems. However, this similarity often fails to capture the speaker similarity under noisy conditions. Therefore, we attempted to examine the efficiency of spectral clustering on i-vector-based similarity for speech corrupted by noise because spectral clustering can yield robustness against noise by non-linear projection. Experimental comparisons demonstrated that spectral clustering yielded significant improvement from conventional methods, such as agglomerative clustering and k-means clustering, under non-stationary noise conditions.
  • Keywords
    pattern clustering; speaker recognition; i-vector-based cosine similarity; i-vector-based speaker clustering system; nonlinear projection; speaker similarity; spectral clustering; speech utterances; Accuracy; Databases; Laplace equations; Noise; Noise measurement; Noise robustness; Speech; i-vector; noise-robust speaker clustering; spectral clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178329
  • Filename
    7178329