• DocumentCode
    38159
  • Title

    On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition

  • Author

    Diez, Mireia ; Varona, Amparo ; Penagarikano, Mike ; Rodriguez-Fuentes, Luis Javier ; Bordel, German

  • Author_Institution
    Dept. of Electr. & Electron., Univ. of the Basque Country, Leioa, Spain
  • Volume
    21
  • Issue
    6
  • fYear
    2014
  • fDate
    Jun-14
  • Firstpage
    649
  • Lastpage
    652
  • Abstract
    In this letter, we apply Phone Log-Likelihood Ratio (PLLR) features to the task of speaker recognition. PLLRs, which are computed on the phone posterior probabilities provided by phone decoders, convey acoustic-phonetic information in a sequence of frame-level vectors, and therefore can be easily plugged into traditional acoustic systems, just by replacing the Mel-Frequency Cepstral Coefficients (MFCC) or an alternate representation. To study the performance of the proposed features, MFCC-based and PLLR-based systems are trained under an i-vector-PLDA approach. Results on the NIST 2010 and 2012 Speaker Recognition Evaluation databases show that, despite yielding lower performance than the acoustic system, the system based on PLLR features does provide significant gains when both systems are fused, which reveals a complementarity among features, and provides a suitable and effective way of using higher level phonetic information in speaker recognition systems.
  • Keywords
    audio databases; probability; speaker recognition; MFCC; Mel frequency cepstral coefficients; PLLR features; acoustic systems; convey acoustic phonetic information; frame level vectors; improved speaker recognition; phone decoders; phone log likelihood ratio; phone posterior probabilities; phonetic information; speaker recognition evaluation databases; Decoding; Mel frequency cepstral coefficient; NIST; Speaker recognition; Speech; Training; Vectors; PLLR; i-vectors; probabilistic linear discriminant analysis; speaker recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2014.2312213
  • Filename
    6774456