• DocumentCode
    2636701
  • Title

    A Multilingual Hierarchy Mapping Method Based on GHSOM

  • Author

    Yang, Hsin-Chang ; Chen, Ding-Wen ; Lee, Chung-Hong

  • Author_Institution
    Nat. Univ. of Kaohsiung, Kaohsiung
  • fYear
    2008
  • fDate
    18-20 June 2008
  • Firstpage
    305
  • Lastpage
    305
  • Abstract
    With the increasing amount of multilingual texts in the Internet, multilingual text retrieval techniques have become an important research issue. However, the discovery of relationships between different languages remains an open problem. In this paper we propose a method, which applies the growing hierarchical self- organizing map (GHSOM) model, to discover knowledge from multilingual text documents. Multilingual parallel corpora were trained by the GHSOM to generate hierarchical feature maps. A discovery process is then applied on these maps to discover the relationships between documents of different languages. The relationships between keywords of different languages are also revealed. We conducted experiments on a set of Chinese-English bilingual parallel corpora to discover the relationships between documents of these languages.
  • Keywords
    Internet; information retrieval; natural languages; search engines; self-organising feature maps; text analysis; Chinese-English bilingual parallel corpora; GHSOM model; Internet; growing hierarchical self-organizing map model; multilingual hierarchy mapping method; multilingual text document; multilingual text retrieval technique; search engine; Clustering algorithms; Data visualization; Humans; Information retrieval; Internet; Natural languages; Search engines; Telecommunication control; Text analysis; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
  • Conference_Location
    Dalian, Liaoning
  • Print_ISBN
    978-0-7695-3161-8
  • Electronic_ISBN
    978-0-7695-3161-8
  • Type

    conf

  • DOI
    10.1109/ICICIC.2008.48
  • Filename
    4603494