DocumentCode
1861520
Title
An investigation into subspace rapid speaker adaptation for verification
Author
Lucey, Simon ; Chen, Tsuhan
Author_Institution
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
1
fYear
2003
fDate
6-9 July 2003
Abstract
Rapid speaker adaptation is becoming more important in emerging applications where storage, computation and training utterances are at a premium (e.g. PDAs, cell phones). Effective adaptation can be achieved for the task of speaker verification, based on a maximum a posteriori (MAP) learning framework, by restricting the client\´s parametric model to be a linear combination of parameters estimated from training observations and a speaker independent "world" model (i.e. relevance adaptation (RA)). Subspace adaptation (SA) attempts to restrict a client\´s parametric representation to a pre-defined subspace during estimation. In this paper we elucidate where subspace adaptation outperforms world adaptation, demonstrate where and why subspace adaptation is sometimes not as effective and give insights into what cost criteria should be used to construct the adaptation parametric subspace. Results are presented on the acoustic portion of the XM2VTS database for the task of Gaussian mixture model (GMM) based text-independent speaker verification.
Keywords
Gaussian processes; learning (artificial intelligence); maximum likelihood estimation; mobile communication; speaker recognition; telecommunication computing; Gaussian mixture model; XM2VTS database; adaptation parametric subspace; linear parameter combination; maximum a posteriori learning framework; mobile application; speaker independent world model; subspace rapid speaker adaptation; text-independent speaker verification; Adaptation model; Cellular phones; Costs; Hidden Markov models; Loudspeakers; Mobile computing; Parametric statistics; Personal digital assistants; Robustness; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN
0-7803-7965-9
Type
conf
DOI
10.1109/ICME.2003.1220856
Filename
1220856
Link To Document