Title :
Von Mises–Fisher Models in the Total Variability Subspace for Language Recognition
Author :
Lopez-Moreno, Ignacio ; Ramos, Daniel ; Gonzalez-Dominguez, Javier ; Gonzalez-Rodriguez, Joaquin
Author_Institution :
ATVS Biometric Res. Lab., Univ. Autonoma de Madrid, Madrid, Spain
Abstract :
This letter proposes a new modeling approach for the Total Variability subspace within a Language Recognition task. Motivated by previous works in directional statistics, von Mises-Fisher distributions are used for assigning language-conditioned probabilities to language data, assumed to be spherically distributed in this subspace. The two proposed methods use Kernel Density Functions or Finite Mixture Models of such distributions. Experiments conducted on NIST LRE 2009 show that the proposed techniques significantly outperform the baseline cosine distance approach in most of the considered experimental conditions, including different speech conditions, durations and the presence of unseen languages.
Keywords :
natural language processing; probability; speech recognition; statistical analysis; NIST LRE 2009; Von Mises-fisher models; baseline cosine distance approach; directional statistics; finite mixture models; kernel density functions; language data; language recognition task; language-conditioned probability; speech conditions; total variability subspace; unseen languages; von Mises-Fisher distributions; Acoustics; Computational modeling; Data models; Density functional theory; Kernel; Mathematical model; Speaker recognition; Finite mixture models; Von Mises–Fisher; kernel density function; language recognition; total variability;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2011.2170566