DocumentCode :
698258
Title :
Improvement of language Identification performance by Aggregated Phone Recognizer
Author :
Hosseini Amereii, S.A. ; Homayounpour, M.M.
Author_Institution :
Lab. for Intell. Sound & Speech Process., Amirkabir Univ. of Technol., Tehran, Iran
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
1770
Lastpage :
1773
Abstract :
Two popular and better performing approaches to language Identification (LID) are Phone Recognition followed by Language Modeling (PRLM) and Parallel PRLM. In this paper, a new LID approach named Aggregated PRLM or APRLM is proposed. In PRLM based LID systems, only one phone recognizer is used, independently of the language targets. At the opposite, in PPRLM based LID systems, multiple phone recognizers are used, but always independently of the language targets. So it may happen that all phones of a language target don´t occur in at least one of the tokenizers provided by the phone recognizers. In this paper, it is proposed that after the phone recognition step, to aggregate the phone sequences obtained by multiple phone recognizers and to provide a new phone sequence. Several language identification experiments were conducted and the proposed improvements were evaluated using OGI-MLTS corpus. Our results show that APRLM overcomes PPRLM about 1.3% in two language classification tasks.
Keywords :
natural language processing; APRLM; OGI-MLTS corpus; PPRLM based LID systems; aggregated PRLM; aggregated phone recognizer; language classification tasks; language identification performance; language modeling; language targets; multiple phone recognizers; parallel PRLM; phone recognition step; phone sequences; tokenizers; Abstracts; Accuracy; Databases; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077833
Link To Document :
بازگشت