• DocumentCode
    177685
  • Title

    Optimizing PLLR Features for Spoken Language Recognition

  • Author

    Diez, Mireia ; Varona, Amparo ; Penagarikano, Mike ; Rodriguez-Fuentes, Luis Javier ; Bordel, German

  • Author_Institution
    Dept. of Electr. & Electron., Univ. of the Basque Country UPV/EHU, Leioa, Spain
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    779
  • Lastpage
    784
  • Abstract
    Phone Log-Likelihood Ratios (PLLR) have been recently introduced as features for spoken language and speaker recognition systems. This representation has proven to be an effective way of retrieving acoustic-phonotactic information into frame-level vectors, which can be easily plugged into state-of-the-art systems. In a previous work, we began the search of reduced representations of PLLRs, as a mean of reducing computational costs. In this paper, we extend this search, by looking for the optimal compromise between feature vector size and system performance. Results achieved by Principal Component Analysis projection on the PLLR space are extensively analyzed. Also, to evaluate the effect of using larger temporal contexts, a Shifted Delta transformation is applied (and its optimal configuration explored) on highly reduced sets of PCA-projected PLLR features, leading to further performance improvements over the best PCA-projected PLLR set.
  • Keywords
    principal component analysis; speech recognition; PCA-projected PLLR features; PLLR feature optimization; acoustic-phonotactic information; feature vector size; frame-level vectors; phone log-likelihood ratios; principal component analysis; shifted delta transformation; speaker recognition systems; spoken language recognition; system performance; Acoustics; Computational modeling; Decoding; NIST; Principal component analysis; Speech; Vectors; Phone Log-Likelihood Ratios; Spoken Language Recognition; Total Variability Factor Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.144
  • Filename
    6976854