• DocumentCode
    454635
  • Title

    Use of Vocal Source Features in Speaker Segmentation

  • Author

    Chan, W.N. ; Lee, Tan ; Zheng, Nengheng ; Ouyang, Hua

  • Author_Institution
    Dept. of Electron. Eng., Chinese Univ. of Hong Kong
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    This paper addresses the problem of speaker segmentation in telephone conversation. The segmentation is done in three steps: 1) preliminary segmentation to hypothesize speaker turning points; 2) clustering of segments; and 3) re-segmentation to determine speaker identity of each segment. It is found that vocal source related features are more speaker-discriminative than the conventional vocal tract related features for small amount of data. This motivates us to thoughtfully incorporate vocal source features into early stages of the speaker segmentation process, where decisions have to be made with limited data. Speaker segmentation experiments are carried out on 36 summed channel conversations in the NIST 2004 Speaker Recognition Evaluation. The proposed use of vocal source features leads to noticeable performance improvement
  • Keywords
    speech processing; segments clustering; speaker segmentation; telephone conversation; vocal source features; Acoustic signal detection; Audio recording; Cepstral analysis; Indexing; Loudspeakers; NIST; Speaker recognition; Speech recognition; Telephony; Turning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660106
  • Filename
    1660106