• DocumentCode
    454532
  • Title

    Probabilistic Latent Prosody Analysis for Robust Speaker Verification

  • Author

    Chen, Zi-He ; Zeng, Zhi-Ren ; Liao, Yuan-Fu ; Juang, Yau-Tarng

  • Author_Institution
    Dept. of Electr. Eng., Nat. Central Univ., Chung-li
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    In this investigation, two probabilistic latent semantic analyses (PLSA)-based approaches are proposed for use in speaker verification systems to reduce the number of parameters required by prosodic speaker models to (1) estimate reliably speakers´ bi-gram models and to (2) reduce the amount of required training and test data. The basic concept is to (1) adopt PLSA to smooth the underlying n-gram-based prosodic speaker models, and to (2) use PLSA to find a compact latent prosody space to represent efficiently the constellation of speakers. The proposed approaches are evaluated on the standard single-speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus, where only one 2 minute training enrollment speech and 30 s test speech on average are available. Experimental results demonstrated that the proposed approach can reduce the required number of bi-gram parameters from 112 to 88 and 63 per speaker and improve the EERs of MAP-GMM and GMM+T-norm from 12.4% and 9.5% to 10.4% and 8.4%, respectively, and finally to 8.1% after fusing all systems
  • Keywords
    probability; speaker recognition; bi-gram models; probabilistic latent prosody analysis; probabilistic latent semantic analyses; robust speaker verification; Data engineering; Electronic equipment testing; Hidden Markov models; NIST; Reliability engineering; Robustness; Speaker recognition; Speech analysis; System testing; Telephone sets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1659968
  • Filename
    1659968