• DocumentCode
    3071189
  • Title

    Robust Speaker Diarization in a Multi-Speaker Environment Using Autocorrelation-based Noise Subtraction

  • Author

    Mirrezaie, S.M. ; Ahadi, S.M. ; Kashi, A.

  • Author_Institution
    Amirkabir Univ. of Technol., Tehran
  • fYear
    2007
  • fDate
    15-18 Dec. 2007
  • Firstpage
    291
  • Lastpage
    296
  • Abstract
    This paper shows research performed into the topic of speaker diarization for multi-speaker environment. It looks into the algorithms and the implementation of an offline speaker segmentation and indexing system for recorded speech data where usually more than one speaker is present. Speaker diarization is a well studied topic in the domain of broadcast news recordings. Most of the proposed systems involve hierarchical clustering of the data, where the number of speakers and their identities are known a priori. Speaker diarization is the task of assigning a unique label to all speech segments in an audio stream by the same speaker. There are two key challenges: processing speed and robustness in the presence of noise. In this paper we address the robustness issue by using a method already successful in speech recognition application. Using ANS (Autocorrelation-Based Noise Subtraction) for robust genetic algorithm-based speaker diarization, we compare the results with the baseline MFCC-based system in clean and noisy conditions.
  • Keywords
    correlation methods; genetic algorithms; pattern clustering; speaker recognition; speech processing; autocorrelation-based noise subtraction; broadcast news recording; data clustering; genetic algorithm; indexing system; multispeaker environment; offline speaker segmentation; speaker diarization; speech recognition; Audio recording; Autocorrelation; Broadcasting; Clustering algorithms; Genetics; Indexing; Noise robustness; Speech recognition; Streaming media; Working environment noise; Robust speaker diarization; meetings indexing; noisy speech; speaker segmentation and clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology, 2007 IEEE International Symposium on
  • Conference_Location
    Giza
  • Print_ISBN
    978-1-4244-1835-0
  • Electronic_ISBN
    978-1-4244-1835-0
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2007.4458171
  • Filename
    4458171