DocumentCode
839084
Title
Application of time-frequency principal component analysis to text-independent speaker identification
Author
Magrin-Chagnolleau, Ivan ; Durou, Geoffrey ; Bimbot, Frédéric
Author_Institution
CNRS, Lyon, France
Volume
10
Issue
6
fYear
2002
fDate
9/1/2002 12:00:00 AM
Firstpage
371
Lastpage
378
Abstract
We propose a formalism, called vector filtering of spectral trajectories, that allows the integration of a number of speech parameterization approaches (cepstral analysis, Δ and ΔΔ parameterizations, auto-regressive vector modeling, ...) under a common formalism. We then propose a new filtering, called contextual principal components (CPC) or time-frequency principal components (TFPC). This filtering consists in extracting the principal components of the contextual covariance matrix, which is the covariance matrix of a sequence of vectors expanded by their context. We apply this new filtering in the framework of closed-set speaker identification, using a subset of the POLYCOST database. When using speaker-dependent TFPC filters, our results show a relative improvement of approximately 20% compared to the use of the classical cepstral coefficients augmented by their Δ-coefficients, which is significantly better with a 90% confidence level.
Keywords
covariance matrices; filtering theory; principal component analysis; speaker recognition; time-frequency analysis; Δ parameterization; ΔΔ parameterization; Δ-coefficients; POLYCOST database; auto-regressive vector modeling; cepstral analysis; cepstral coefficients; closed-set speaker identification; confidence level; contextual covariance matrix; contextual principal components; spectral trajectories; speech parameterization; text-independent speaker identification; time-frequency principal component analysis; vector filtering; Cepstral analysis; Covariance matrix; Data mining; Filtering; Filters; Principal component analysis; Spatial databases; Speaker recognition; Speech analysis; Time frequency analysis;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/TSA.2002.800557
Filename
1040261
Link To Document