Title :
Indexing telephone conversations by speakers using time-frequency principal component analysis
Author :
Magrin-Chagnolleau, Ivan ; Bimbot, Frédéric
Author_Institution :
IRISA, Rennes, France
Abstract :
We present an algorithm for the tracking of target speakers in telephone conversations. Speaker tracking consists in retrieving, in an audio recording, segments which have been uttered by a target speaker. We also compare two speech analysis techniques. The first one is the time-frequency principal component analysis. It is a new speech analysis technique based on the extraction of the principal components of the contextual covariance matrix, which is the covariance matrix of feature vectors expanded by their time context. The other one is the classical cepstral analysis. Experiments are carried out on a subset of the switchboard database
Keywords :
cepstral analysis; covariance matrices; database indexing; multimedia databases; principal component analysis; speaker recognition; speech processing; time-frequency analysis; audio recording; cepstral analysis; covariance matrix; experiments; feature vectors; multimedia database; speaker tracking; speech analysis; speech retrieval; switchboard database; telephone conversation indexing; time-frequency principal component analysis; Audio recording; Cepstral analysis; Covariance matrix; Indexing; Principal component analysis; Spatial databases; Speech analysis; Target tracking; Telephony; Time frequency analysis;
Conference_Titel :
Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
0-7803-6536-4
DOI :
10.1109/ICME.2000.871500