DocumentCode :
839084
Title :
Application of time-frequency principal component analysis to text-independent speaker identification
Author :
Magrin-Chagnolleau, Ivan ; Durou, Geoffrey ; Bimbot, Frédéric
Author_Institution :
CNRS, Lyon, France
Volume :
10
Issue :
6
fYear :
2002
fDate :
9/1/2002 12:00:00 AM
Firstpage :
371
Lastpage :
378
Abstract :
We propose a formalism, called vector filtering of spectral trajectories, that allows the integration of a number of speech parameterization approaches (cepstral analysis, Δ and ΔΔ parameterizations, auto-regressive vector modeling, ...) under a common formalism. We then propose a new filtering, called contextual principal components (CPC) or time-frequency principal components (TFPC). This filtering consists in extracting the principal components of the contextual covariance matrix, which is the covariance matrix of a sequence of vectors expanded by their context. We apply this new filtering in the framework of closed-set speaker identification, using a subset of the POLYCOST database. When using speaker-dependent TFPC filters, our results show a relative improvement of approximately 20% compared to the use of the classical cepstral coefficients augmented by their Δ-coefficients, which is significantly better with a 90% confidence level.
Keywords :
covariance matrices; filtering theory; principal component analysis; speaker recognition; time-frequency analysis; Δ parameterization; ΔΔ parameterization; Δ-coefficients; POLYCOST database; auto-regressive vector modeling; cepstral analysis; cepstral coefficients; closed-set speaker identification; confidence level; contextual covariance matrix; contextual principal components; spectral trajectories; speech parameterization; text-independent speaker identification; time-frequency principal component analysis; vector filtering; Cepstral analysis; Covariance matrix; Data mining; Filtering; Filters; Principal component analysis; Spatial databases; Speaker recognition; Speech analysis; Time frequency analysis;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2002.800557
Filename :
1040261
Link To Document :
بازگشت