• DocumentCode
    258689
  • Title

    Towards improving the performance of speaker recognition systems

  • Author

    Johnson, Neethu ; George, Kuruvachan K. ; Kumar, C. Santhosh ; Raj, P. C. Reghu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Gov. Eng. Coll., Palakkad, India
  • fYear
    2014
  • fDate
    17-18 Dec. 2014
  • Firstpage
    38
  • Lastpage
    41
  • Abstract
    This paper studies the contribution of different phones in speech data towards improving the performance of text/language independent speaker recognition systems. This work is motivated by the fact that the removal of silence segments from the speech data improves the system performance significantly as it does not contain any speaker-specific information. It is also clear from the literature that not all the phones in the speech data contains equal amount of speaker-specific information in it and the performance of the speaker recognition systems depends on this information. In addition to the silence segments, our work empirically finds 18 other diluent phones that has minimum speaker discrimination capability. We propose to use a preprocessing stage that identifies all non-informative set of phones recursively and removes them along with silence segments. Results show that using phones removed preprocessed data in state-of-the-art i-vector system outperforms the baseline i-vector system. We report absolute improvements of 1%, 1%, 2%, 2% and 1% in EER for test set collected through channels of Digital Voice Recorder, Headset, Mobile Phone 1, Mobile Phone 2 and Tablet PC respectively on IITG-MV database.
  • Keywords
    speaker recognition; IITG-MV database; digital voice recorder; diluent phones; i-vector system; silence segments; speaker discrimination capability; speaker recognition systems performance; speaker-specific information; speech data; text/language independent speaker recognition systems; Data models; Databases; Feature extraction; Sensors; Speaker recognition; Speech; Speech recognition; Speaker recognition; WCCN; cosine scoring; i-vector; spectral matching based VAD; total variability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems and Communications (ICCSC), 2014 First International Conference on
  • Conference_Location
    Trivandrum
  • Print_ISBN
    978-1-4799-6012-5
  • Type

    conf

  • DOI
    10.1109/COMPSC.2014.7032617
  • Filename
    7032617