Title :
WordNet based Cross-Language Text Categorization
Author :
Amine, Bentaallah Mohamed ; Mimoun, Malki
Author_Institution :
EEDIS Lab. Sidi Bel Abbes, Oran
Abstract :
This article is essentially dedicated to the problem of cross-language text categorization, that consists in classifying documents in different languages according to the same classification tree. The proposed approach is based on the idea to spread the utilization of WordNet in text categorization towards cross-language text categorization. Experimental results of the bi-lingual classification of the ILO corpus (with the documents in English and Spanish) show that the idea we describe are promising and deserve further investigation.
Keywords :
natural language processing; pattern classification; text analysis; trees (mathematics); English; ILO corpus; Spanish; WordNet; bilingual classification; classification tree; cross-language text categorization; document classification; Classification tree analysis; Globalization; Gold; IP networks; Laboratories; Natural languages; Shape; Silver; Statistical distributions; Text categorization;
Conference_Titel :
Computer Systems and Applications, 2007. AICCSA '07. IEEE/ACS International Conference on
Conference_Location :
Amman
Print_ISBN :
1-4244-1030-4
Electronic_ISBN :
1-4244-1031-2
DOI :
10.1109/AICCSA.2007.370731