Title :
NAP for high level language identification
Author :
Richardson, F.S. ; Campbell, W.M.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
Varying channel conditions present a difficult problem for many speech technologies such as language identification (LID). Channel compensation techniques have been shown to significantly improve performance in LID for acoustic systems. For high-level token systems, nuisance attribute projection (NAP) has been shown to per form well in the context of speaker identification. In this work, we describe a novel approach to dealing with the high dimensional sparse NAP training problem as applied to a 4-gram phonotactic LID system run on the NIST 2009 Language Recognition Evaluation (LRE) task. We demonstrate performance gains on the Voice of America (VOA) portion of the 2009 LRE data.
Keywords :
high level languages; speaker recognition; Voice of America; acoustic system; channel compensation technique; high level language identification; language recognition evaluation; nuisance attribute projection; phonotactic LID system; speaker identification; Hidden Markov models; Kernel; Lattices; NIST; Speech; Support vector machines; Training;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947327