• DocumentCode
    590609
  • Title

    On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition

  • Author

    Hirano, Ikuya ; Longbiao Wang ; Kai, Atsuhiko ; Nakagawa, Sachiko

  • Author_Institution
    Shizuoka Univ., Hamamatsu, Japan
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Recent studies have shown that phase information contains speaker characteristics. A new extraction method to extract pitch synchronous phase information has been proposed and shown that it was very effective under channel matched condition. However, phase changes between different channels. Therefore, the speaker recognition performance is drastically degraded under channel mismatch condition. On the other hand, joint factor analysis (JFA) is an approach that is robust for channel variability. In this paper, we propose phase information-based JFA for speaker verification under channel mismatch condition. Speaker verification experiments were performed using the NIST 2003 SRE database. Phase information-based JFA achieved a relative equal error rate reduction of 20.9% for male and 17.4% for female compared to the traditional system based on Gaussian mixture model and Universal background model (GMM-UBM) that influenced by channel variability. Furthermore, by combining phase information-based method with the MFCC-based method, we obtained the better result than that of the only MFCC-based method.
  • Keywords
    Gaussian processes; speaker recognition; GMM-UBM; Gaussian mixture model; MFCC-based method; NIST 2003 SRE database; Universal background model; channel matched condition; channel mismatch condition; error rate reduction; phase information-based JFA; phase information-based joint factor analysis; phase information-based method; pitch synchronous phase information extraction; speaker characteristics; speaker verification; Data mining; Databases; Joints; Mel frequency cepstral coefficient; NIST; Speaker recognition; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6411756