• DocumentCode
    3640863
  • Title

    Short-time Gaussianization for robust speaker verification

  • Author

    Bing Xiang;Upendra V. Chaudhari;Jiři Navrátil;Ganesh N. Ramaswamy;Ramesh A. Gopinath

  • Author_Institution
    IBM T.J. Watson Research Center, Yorktown Heights, NY 10598, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    5/1/2002 12:00:00 AM
  • Abstract
    In this paper, a novel approach for robust speaker verification, namely short-time Gaussianization, is proposed. Short-time Gaussianization is initiated by a global linear transformation of the features, followed by a short-time windowed cumulative distribution function (CDF) matching. First, the linear transformation in the feature space leads to local independence or decorrelation. Then the CDF matching is applied to segments of speech localized in time and tries to warp a given feature so that its CDF matches normal distribution. It is shown that one of the recent techniques used for speaker recognition, feature warping [l] can be formulated within the framework of Gaussianization. Compared to the baseline system with cepstral mean subtraction (CMS), around 20% relative improvement in both equal error rate(EER) and minimum detection cost function (DCF) is obtained on NIST 2001 cellular phone data evaluation.
  • Keywords
    "Robustness","Random access memory"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743809
  • Filename
    5743809