Speaker identification by anchor models with PCA/LDA post-processing

Author

Mami, Yassine ; Charlet, Delphine

Author_Institution

France Telecom R&D, Lannion, France

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

Speaker representation by location is a new technique of speaker recognition and adaptation. It consists in representing a new speaker, not in an absolute manner, but relatively to a set of well trained speaker models. Each new speaker is represented by its location in an optimal representation space. This paper addresses the location task. It describes a representation space built either by clustering speakers or by selecting an optimal subset of them. In this representation space, speaker location is then performed by the anchor models technique to find vector of coordinates. An orthogonalization process is then applied to the vector of coordinates, so as to compute the distance properly. This orthogonalization process (PCA or LDA) proves experimentally to improve significantly the recognition.

Keywords

pattern clustering; speaker recognition; PCA/LDA post-processing; anchor models; clustering; optimal representation space; optimal subset; orthogonalization process; speaker identification; speaker recognition; vector of coordinates; well trained speaker models; Linear discriminant analysis; Parameter estimation; Principal component analysis; Scattering; Speaker recognition; Speech analysis; Telecommunications; Training data; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198746

Filename

1198746