DocumentCode :
2799501
Title :
WordNet based Cross-Language Text Categorization
Author :
Amine, Bentaallah Mohamed ; Mimoun, Malki
Author_Institution :
EEDIS Lab. Sidi Bel Abbes, Oran
fYear :
2007
fDate :
13-16 May 2007
Firstpage :
848
Lastpage :
855
Abstract :
This article is essentially dedicated to the problem of cross-language text categorization, that consists in classifying documents in different languages according to the same classification tree. The proposed approach is based on the idea to spread the utilization of WordNet in text categorization towards cross-language text categorization. Experimental results of the bi-lingual classification of the ILO corpus (with the documents in English and Spanish) show that the idea we describe are promising and deserve further investigation.
Keywords :
natural language processing; pattern classification; text analysis; trees (mathematics); English; ILO corpus; Spanish; WordNet; bilingual classification; classification tree; cross-language text categorization; document classification; Classification tree analysis; Globalization; Gold; IP networks; Laboratories; Natural languages; Shape; Silver; Statistical distributions; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications, 2007. AICCSA '07. IEEE/ACS International Conference on
Conference_Location :
Amman
Print_ISBN :
1-4244-1030-4
Electronic_ISBN :
1-4244-1031-2
Type :
conf
DOI :
10.1109/AICCSA.2007.370731
Filename :
4231059
Link To Document :
بازگشت