• DocumentCode
    394225
  • Title

    Speaker identification by anchor models with PCA/LDA post-processing

  • Author

    Mami, Yassine ; Charlet, Delphine

  • Author_Institution
    France Telecom R&D, Lannion, France
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Speaker representation by location is a new technique of speaker recognition and adaptation. It consists in representing a new speaker, not in an absolute manner, but relatively to a set of well trained speaker models. Each new speaker is represented by its location in an optimal representation space. This paper addresses the location task. It describes a representation space built either by clustering speakers or by selecting an optimal subset of them. In this representation space, speaker location is then performed by the anchor models technique to find vector of coordinates. An orthogonalization process is then applied to the vector of coordinates, so as to compute the distance properly. This orthogonalization process (PCA or LDA) proves experimentally to improve significantly the recognition.
  • Keywords
    pattern clustering; speaker recognition; PCA/LDA post-processing; anchor models; clustering; optimal representation space; optimal subset; orthogonalization process; speaker identification; speaker recognition; vector of coordinates; well trained speaker models; Linear discriminant analysis; Parameter estimation; Principal component analysis; Scattering; Speaker recognition; Speech analysis; Telecommunications; Training data; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198746
  • Filename
    1198746