DocumentCode :
2501333
Title :
Constructing Multilingual Preterminological Graphs using various online-community resources
Author :
Daoud, Mohammad ; Boitet, Christian ; Kageura, Kyo ; Kitamoto, Asanobu ; Daoud, Daoud ; Mangeot, Mathieu
Author_Institution :
Grenoble Inf. Lab., Univ. Joseph Fourier, Grenoble, France
fYear :
2009
fDate :
20-22 Oct. 2009
Firstpage :
116
Lastpage :
121
Abstract :
We are describe the concept of dedicated multilingual preterminological graphs MPGs, and some automatic approaches for constructing them by analyzing the behavior of online community users. A multilingual preterminological graph is a special lexical resource that contains massive amount of terms related to a special domain, and can be used as raw material to later build a standardized terminological repository. Building such a graph is difficult using traditional approaches, as it needs huge efforts by domain specialists and terminologists. In our approach, we build such a graph by analyzing the access log files of the Web site of the community, and by finding the important terms that have been used to search in that Web site, and their association with each other. We aim at making this graph as a seed repository so multilingual volunteers can contribute. We are experimenting this approach with the Digital Silk Road Project. We have used its access log files since its beginning in 2003, and obtained an initial graph of around 116000 terms. As an application, we used this graph to obtain a preterminological multilingual database that is serving a CLIR system for the DSR project.
Keywords :
Web sites; directed graphs; information retrieval; information retrieval systems; natural language processing; CLIR system; DSR project; Digital Silk Road Project; MPG; access log file; automatic construction approach; community Web site; cross-language information retrieval; dedicated multilingual preterminological graph construction; directed graph; lexical resource; natural language processing; online community user behavior analysis; online-community resources; preterminological multilingual database; standardized terminological repository; term search; Artificial neural networks; Biological neural networks; Decision making; Machine learning; Natural language processing; Neural networks; Radial basis function networks; Support vector machine classification; Support vector machines; Weather forecasting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4244-4138-9
Electronic_ISBN :
978-1-4244-4139-6
Type :
conf
DOI :
10.1109/SNLP.2009.5340936
Filename :
5340936
Link To Document :
بازگشت