• DocumentCode
    178920
  • Title

    Type-2 Fuzzy GMMs for Robust Text-Independent Speaker Verification in Noisy Environments

  • Author

    Pinheiro, H.N.B. ; Tsang Ing Ren ; Cavalcanti, G.D.C. ; Tsang Ing Jyh ; Sijbers, J.

  • Author_Institution
    Centro de Inf. (CIn), Univ. Fed. de Pernambuco (UFPE), Recife, Brazil
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    4531
  • Lastpage
    4536
  • Abstract
    This paper proposes the use of the type-2 fuzzy GMM (T2FGMM) framework in order to improve the verification rates of the standard GMM-UBM text-independent speaker verification system in noisy environments. Based on type-2 fuzzy sets, the T2FGMM framework describes GMMs with uncertain parameters and provides likelihood intervals for them. The proposed method (T2F-GMM-UBM) estimates the parameter intervals using the noisy speeches from the speakers and the Bayesian estimation used in the standard GMM-UBM system. The proposed method was evaluated using the MIT Device Speaker Verification Corpus (MITDSVC) which contains speeches from 48 speakers recorded in three different locations: a quiet office, a mildly noisy lobby, and a busy street intersection. The Equal Error Rate (EER) was computed for each speaker and the mean and standard deviation were analyzed. Although the proposed method did not achieve better performance in the office location, significant improvements were achieved in both lobby and street intersection locations. The improvement in the lobby was 14.21% while in the street intersection location was 10.47%. The left tailed paired Rank Sign Wilcox on Test was also performed in both locations and the p-values found were 0.0127 and 0.0230, respectively. The proposed method proved to have better performance in noisy environments compared to the standard GMM-UBM system.
  • Keywords
    Gaussian processes; mixture models; speaker recognition; text analysis; Bayesian estimation; EER; MIT device speaker verification corpus; MITDSVC; Rank Sign Wilcox; T2FGMM framework; equal error rate; noisy environments; noisy speech; robust text-independent speaker verification; street intersection location; type-2 fuzzy GMM; Adaptation models; Gaussian distribution; Mathematical model; Noise measurement; Speech; Standards; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.775
  • Filename
    6977488