مرکز منطقه ای اطلاع رساني علوم و فناوري - NAP for high level language identification

DocumentCode :

2173287

Title :

NAP for high level language identification

Author :

Richardson, F.S. ; Campbell, W.M.

Author_Institution :

Lincoln Lab., MIT, Lexington, MA, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4392

Lastpage :

4395

Abstract :

Varying channel conditions present a difficult problem for many speech technologies such as language identification (LID). Channel compensation techniques have been shown to significantly improve performance in LID for acoustic systems. For high-level token systems, nuisance attribute projection (NAP) has been shown to per form well in the context of speaker identification. In this work, we describe a novel approach to dealing with the high dimensional sparse NAP training problem as applied to a 4-gram phonotactic LID system run on the NIST 2009 Language Recognition Evaluation (LRE) task. We demonstrate performance gains on the Voice of America (VOA) portion of the 2009 LRE data.

Keywords :

high level languages; speaker recognition; Voice of America; acoustic system; channel compensation technique; high level language identification; language recognition evaluation; nuisance attribute projection; phonotactic LID system; speaker identification; Hidden Markov models; Kernel; Lattices; NIST; Speech; Support vector machines; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947327

Filename :

5947327

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2173287