• DocumentCode
    113153
  • Title

    Domain Mismatch Compensation for Speaker Recognition Using a Library of Whiteners

  • Author

    Singer, Elliot ; Reynolds, Douglas A.

  • Author_Institution
    MIT Lincoln Lab., Lexington, MA, USA
  • Volume
    22
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    2000
  • Lastpage
    2003
  • Abstract
    The development of the i-vector framework for generating low dimensional representations of speech utterances has led to considerable improvements in speaker recognition performance. Although these gains have been achieved in periodic National Institute of Standards and Technology (NIST) evaluations, the problem of domain mismatch, where the system development data and the application data are collected from different sources, remains a challenging one. The impact of domain mismatch was a focus of the Johns Hopkins University (JHU) 2013 speaker recognition workshop, where a domain adaptation challenge (DAC13) corpus was created to address this problem. This paper proposes an approach to domain mismatch compensation for applications where in-domain development data is assumed to be unavailable. The method is based on a generalization of data whitening used in association with i-vector length normalization and utilizes a library of whitening transforms trained at system development time using strictly out-of-domain data. The approach is evaluated on the 2013 domain adaptation challenge task and is shown to compare favorably to in-domain conventional whitening and to nuisance attribute projection (NAP) inter-dataset variability compensation.
  • Keywords
    compensation; speaker recognition; DAC13; JHU; Johns Hopkins University; NIST; National Institute of Standards and Technology; data whitening generalization; domain adaptation challenge; domain mismatch compensation; i-vector length normalization; speaker recognition; speech utterance representation; whitener library; Computational modeling; Conferences; Covariance matrices; Libraries; NIST; Speaker recognition; Speech; Channel compensation; domain mismatch; i-vectors; whitening;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2015.2451591
  • Filename
    7145413