Title :
Optimizing PLLR Features for Spoken Language Recognition
Author :
Diez, Mireia ; Varona, Amparo ; Penagarikano, Mike ; Rodriguez-Fuentes, Luis Javier ; Bordel, German
Author_Institution :
Dept. of Electr. & Electron., Univ. of the Basque Country UPV/EHU, Leioa, Spain
Abstract :
Phone Log-Likelihood Ratios (PLLR) have been recently introduced as features for spoken language and speaker recognition systems. This representation has proven to be an effective way of retrieving acoustic-phonotactic information into frame-level vectors, which can be easily plugged into state-of-the-art systems. In a previous work, we began the search of reduced representations of PLLRs, as a mean of reducing computational costs. In this paper, we extend this search, by looking for the optimal compromise between feature vector size and system performance. Results achieved by Principal Component Analysis projection on the PLLR space are extensively analyzed. Also, to evaluate the effect of using larger temporal contexts, a Shifted Delta transformation is applied (and its optimal configuration explored) on highly reduced sets of PCA-projected PLLR features, leading to further performance improvements over the best PCA-projected PLLR set.
Keywords :
principal component analysis; speech recognition; PCA-projected PLLR features; PLLR feature optimization; acoustic-phonotactic information; feature vector size; frame-level vectors; phone log-likelihood ratios; principal component analysis; shifted delta transformation; speaker recognition systems; spoken language recognition; system performance; Acoustics; Computational modeling; Decoding; NIST; Principal component analysis; Speech; Vectors; Phone Log-Likelihood Ratios; Spoken Language Recognition; Total Variability Factor Analysis;
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
DOI :
10.1109/ICPR.2014.144