• DocumentCode
    698258
  • Title

    Improvement of language Identification performance by Aggregated Phone Recognizer

  • Author

    Hosseini Amereii, S.A. ; Homayounpour, M.M.

  • Author_Institution
    Lab. for Intell. Sound & Speech Process., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    1770
  • Lastpage
    1773
  • Abstract
    Two popular and better performing approaches to language Identification (LID) are Phone Recognition followed by Language Modeling (PRLM) and Parallel PRLM. In this paper, a new LID approach named Aggregated PRLM or APRLM is proposed. In PRLM based LID systems, only one phone recognizer is used, independently of the language targets. At the opposite, in PPRLM based LID systems, multiple phone recognizers are used, but always independently of the language targets. So it may happen that all phones of a language target don´t occur in at least one of the tokenizers provided by the phone recognizers. In this paper, it is proposed that after the phone recognition step, to aggregate the phone sequences obtained by multiple phone recognizers and to provide a new phone sequence. Several language identification experiments were conducted and the proposed improvements were evaluated using OGI-MLTS corpus. Our results show that APRLM overcomes PPRLM about 1.3% in two language classification tasks.
  • Keywords
    natural language processing; APRLM; OGI-MLTS corpus; PPRLM based LID systems; aggregated PRLM; aggregated phone recognizer; language classification tasks; language identification performance; language modeling; language targets; multiple phone recognizers; parallel PRLM; phone recognition step; phone sequences; tokenizers; Abstracts; Accuracy; Databases; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077833