• DocumentCode
    187664
  • Title

    The relevance of NIST speaker recognition evaluations

  • Author

    Asha, T. ; Murthy, Hema A.

  • Author_Institution
    Indian Inst. of Technol., Madras, Chennai, India
  • fYear
    2014
  • fDate
    22-25 July 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Feature extraction and building of the Universal Background Model (UBM) are crucial for building speaker verification/identification systems in the total variability subspace (TVS) framework. The motivation of this study is to analyze the significance of various parameters involved in front end processing for different databases. A number of different parameters like energy threshold for voice activity detection, the number of filters, the warping of the frequency scale, the number of cepstral coefficients and the shape of the filter are studied. Three different databases namely, NIST 2003, NIST 2010 and NTIMIT are studied. The optimal front-end obtained using NIST 2003 is observed to function well for NIST 2010 as conditions involving similar data was evaluated for both the databases. On the other hand, it is shown that the same optimal front-end is not scalable for NTIMIT database which is collected from a different environment. The experiments performed in this paper indicate that the optimal front-end parameters are specific to a particular dataset. In addition, mismatch between development data and evaluation data is shown to result in a poor system. Given the results, the paper questions the relevance of the NIST Speaker Recognition evaluations in real environments.
  • Keywords
    feature extraction; speaker recognition; NIST speaker recognition evaluations; NTIMIT database; TVS framework; UBM; cepstral coefficients; databases; feature extraction; speaker identification systems; speaker verification; total variability subspace; universal background model; voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications (SPCOM), 2014 International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-4799-4666-2
  • Type

    conf

  • DOI
    10.1109/SPCOM.2014.6983988
  • Filename
    6983988