• DocumentCode
    839084
  • Title

    Application of time-frequency principal component analysis to text-independent speaker identification

  • Author

    Magrin-Chagnolleau, Ivan ; Durou, Geoffrey ; Bimbot, Frédéric

  • Author_Institution
    CNRS, Lyon, France
  • Volume
    10
  • Issue
    6
  • fYear
    2002
  • fDate
    9/1/2002 12:00:00 AM
  • Firstpage
    371
  • Lastpage
    378
  • Abstract
    We propose a formalism, called vector filtering of spectral trajectories, that allows the integration of a number of speech parameterization approaches (cepstral analysis, Δ and ΔΔ parameterizations, auto-regressive vector modeling, ...) under a common formalism. We then propose a new filtering, called contextual principal components (CPC) or time-frequency principal components (TFPC). This filtering consists in extracting the principal components of the contextual covariance matrix, which is the covariance matrix of a sequence of vectors expanded by their context. We apply this new filtering in the framework of closed-set speaker identification, using a subset of the POLYCOST database. When using speaker-dependent TFPC filters, our results show a relative improvement of approximately 20% compared to the use of the classical cepstral coefficients augmented by their Δ-coefficients, which is significantly better with a 90% confidence level.
  • Keywords
    covariance matrices; filtering theory; principal component analysis; speaker recognition; time-frequency analysis; Δ parameterization; ΔΔ parameterization; Δ-coefficients; POLYCOST database; auto-regressive vector modeling; cepstral analysis; cepstral coefficients; closed-set speaker identification; confidence level; contextual covariance matrix; contextual principal components; spectral trajectories; speech parameterization; text-independent speaker identification; time-frequency principal component analysis; vector filtering; Cepstral analysis; Covariance matrix; Data mining; Filtering; Filters; Principal component analysis; Spatial databases; Speaker recognition; Speech analysis; Time frequency analysis;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2002.800557
  • Filename
    1040261