• DocumentCode
    1861520
  • Title

    An investigation into subspace rapid speaker adaptation for verification

  • Author

    Lucey, Simon ; Chen, Tsuhan

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    Rapid speaker adaptation is becoming more important in emerging applications where storage, computation and training utterances are at a premium (e.g. PDAs, cell phones). Effective adaptation can be achieved for the task of speaker verification, based on a maximum a posteriori (MAP) learning framework, by restricting the client\´s parametric model to be a linear combination of parameters estimated from training observations and a speaker independent "world" model (i.e. relevance adaptation (RA)). Subspace adaptation (SA) attempts to restrict a client\´s parametric representation to a pre-defined subspace during estimation. In this paper we elucidate where subspace adaptation outperforms world adaptation, demonstrate where and why subspace adaptation is sometimes not as effective and give insights into what cost criteria should be used to construct the adaptation parametric subspace. Results are presented on the acoustic portion of the XM2VTS database for the task of Gaussian mixture model (GMM) based text-independent speaker verification.
  • Keywords
    Gaussian processes; learning (artificial intelligence); maximum likelihood estimation; mobile communication; speaker recognition; telecommunication computing; Gaussian mixture model; XM2VTS database; adaptation parametric subspace; linear parameter combination; maximum a posteriori learning framework; mobile application; speaker independent world model; subspace rapid speaker adaptation; text-independent speaker verification; Adaptation model; Cellular phones; Costs; Hidden Markov models; Loudspeakers; Mobile computing; Parametric statistics; Personal digital assistants; Robustness; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1220856
  • Filename
    1220856