• DocumentCode
    357068
  • Title

    Indexing telephone conversations by speakers using time-frequency principal component analysis

  • Author

    Magrin-Chagnolleau, Ivan ; Bimbot, Frédéric

  • Author_Institution
    IRISA, Rennes, France
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    881
  • Abstract
    We present an algorithm for the tracking of target speakers in telephone conversations. Speaker tracking consists in retrieving, in an audio recording, segments which have been uttered by a target speaker. We also compare two speech analysis techniques. The first one is the time-frequency principal component analysis. It is a new speech analysis technique based on the extraction of the principal components of the contextual covariance matrix, which is the covariance matrix of feature vectors expanded by their time context. The other one is the classical cepstral analysis. Experiments are carried out on a subset of the switchboard database
  • Keywords
    cepstral analysis; covariance matrices; database indexing; multimedia databases; principal component analysis; speaker recognition; speech processing; time-frequency analysis; audio recording; cepstral analysis; covariance matrix; experiments; feature vectors; multimedia database; speaker tracking; speech analysis; speech retrieval; switchboard database; telephone conversation indexing; time-frequency principal component analysis; Audio recording; Cepstral analysis; Covariance matrix; Indexing; Principal component analysis; Spatial databases; Speech analysis; Target tracking; Telephony; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
  • Conference_Location
    New York, NY
  • Print_ISBN
    0-7803-6536-4
  • Type

    conf

  • DOI
    10.1109/ICME.2000.871500
  • Filename
    871500