DocumentCode
2361489
Title
Automatic Construction of Multilingual Web Directory Using Self-Organizing Maps
Author
Yang, Hsin-Chang ; Hsiao, Han-Wei ; Lee, Chung-Hong
Author_Institution
Dept. of Inf. Manage., Nat. Univ. of Kaohsiung, Kaohsiung, Taiwan
fYear
2009
fDate
25-27 Aug. 2009
Firstpage
1283
Lastpage
1288
Abstract
Web directories cluster Web pages into categories and usually organize them into hierarchies. Many users used them to browse for interesting Web pages in a coarse-to-fine manner. Nowadays most of the Web directories access monolingual Web pages and provide only monolingual interface which may limit the coverage and accessibility of Web pages for users familiar only with their native languages. Bilingual or multilingual Web directories thus may relieve such limitations. In this work, we develop an automated process to create multilingual (or bilingual, specifically) Web directories from a set of parallel corpora. We adopted the self-organizing map model to cluster the Web pages and construct Web directories for each language. A hierarchy alignment process was then applied on these monolingual hierarchies to obtain the relationships between different languages. A multilingual Web directory was then created using such relationships. We conducted experiments on a set of parallel corpora and the result demonstrated that our method could be feasible.
Keywords
Web design; data mining; natural language processing; pattern clustering; self-organising feature maps; Web pages clustering; bilingual Web directory; hierarchy alignment process; monolingual interface; multilingual Web directory automatic construction; parallel corpora set; self-organizing map; Humans; Information management; Machine learning algorithms; Natural languages; Navigation; Portals; Self organizing feature maps; Text categorization; Text mining; Web pages; Hierarchy Alignment; Multilingual Web Directory; Self-Organizing Map; Web Directory Construction;
fLanguage
English
Publisher
ieee
Conference_Titel
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4244-5209-5
Electronic_ISBN
978-0-7695-3769-6
Type
conf
DOI
10.1109/NCM.2009.78
Filename
5331506
Link To Document