DocumentCode
698258
Title
Improvement of language Identification performance by Aggregated Phone Recognizer
Author
Hosseini Amereii, S.A. ; Homayounpour, M.M.
Author_Institution
Lab. for Intell. Sound & Speech Process., Amirkabir Univ. of Technol., Tehran, Iran
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
1770
Lastpage
1773
Abstract
Two popular and better performing approaches to language Identification (LID) are Phone Recognition followed by Language Modeling (PRLM) and Parallel PRLM. In this paper, a new LID approach named Aggregated PRLM or APRLM is proposed. In PRLM based LID systems, only one phone recognizer is used, independently of the language targets. At the opposite, in PPRLM based LID systems, multiple phone recognizers are used, but always independently of the language targets. So it may happen that all phones of a language target don´t occur in at least one of the tokenizers provided by the phone recognizers. In this paper, it is proposed that after the phone recognition step, to aggregate the phone sequences obtained by multiple phone recognizers and to provide a new phone sequence. Several language identification experiments were conducted and the proposed improvements were evaluated using OGI-MLTS corpus. Our results show that APRLM overcomes PPRLM about 1.3% in two language classification tasks.
Keywords
natural language processing; APRLM; OGI-MLTS corpus; PPRLM based LID systems; aggregated PRLM; aggregated phone recognizer; language classification tasks; language identification performance; language modeling; language targets; multiple phone recognizers; parallel PRLM; phone recognition step; phone sequences; tokenizers; Abstracts; Accuracy; Databases; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077833
Link To Document