• DocumentCode
    337490
  • Title

    A new cohort normalization using local acoustic information for speaker verification

  • Author

    Isobe, Toshihiro ; Takahashi, Jun-ichi

  • Author_Institution
    Lab. for Inf. Technol., NTT Data Corp., Kanagawa, Japan
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    841
  • Abstract
    This paper describes a new cohort normalization method for HMM based speaker verification. In the proposed method, cohort models are synthesized based on the similarity of local acoustic features between speakers. The similarity can be determined using acoustic information lying in model components such as phonemes, states, and the Gaussian distributions of HMMs. With the method, the synthesized models can provide an effective normalizing score for various observed measurements because the difference between the individual reference model and the synthesized cohort models is statistically reduced through fine evaluation of acoustic similarity in model structure level. In the experiments using telephone speech of 100 speakers, it was found that high verification performance can be achieved by the proposed method: the equal error rate (EER) was drastically reduced from 1.20% (obtained by the conventional speaker-selection based cohort normalization) to 0.30% (obtained by the proposed method on distribution-based selection) in a closed test. Furthermore, EER was also reduced from 1.40% to 0.70% in open test (reference speaker: 25, impostor: 75), when the other speakers than the reference speaker were used as impostors
  • Keywords
    Gaussian distribution; acoustic signal processing; hidden Markov models; speaker recognition; Gaussian distributions; HMM based speaker verification; acoustic similarity; closed test; cohort normalization; distribution-based selection; equal error rate; experiments; high verification performance; impostors; local acoustic features similarity; local acoustic information; normalizing score; observed measurements; open test; phonemes; reference model; reference speaker; states; synthesized cohort models; telephone speech; Acoustic measurements; Error analysis; Gaussian distribution; Hidden Markov models; Information technology; Laboratories; Loudspeakers; Speech synthesis; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759802
  • Filename
    759802