DocumentCode :
2173287
Title :
NAP for high level language identification
Author :
Richardson, F.S. ; Campbell, W.M.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4392
Lastpage :
4395
Abstract :
Varying channel conditions present a difficult problem for many speech technologies such as language identification (LID). Channel compensation techniques have been shown to significantly improve performance in LID for acoustic systems. For high-level token systems, nuisance attribute projection (NAP) has been shown to per form well in the context of speaker identification. In this work, we describe a novel approach to dealing with the high dimensional sparse NAP training problem as applied to a 4-gram phonotactic LID system run on the NIST 2009 Language Recognition Evaluation (LRE) task. We demonstrate performance gains on the Voice of America (VOA) portion of the 2009 LRE data.
Keywords :
high level languages; speaker recognition; Voice of America; acoustic system; channel compensation technique; high level language identification; language recognition evaluation; nuisance attribute projection; phonotactic LID system; speaker identification; Hidden Markov models; Kernel; Lattices; NIST; Speech; Support vector machines; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947327
Filename :
5947327
Link To Document :
بازگشت