Title :
Conceptual Indexing of Documents Using Wikipedia
Author :
Chahine, Carlo Abi ; Chaignaud, Nathalie ; Kotowicz, Jean-philippe ; Pécuchet, Jean-Pierre
Author_Institution :
LITIS, INSA, Rouen, France
Abstract :
This paper presents an indexing support system that suggests for librarians a set of topics and keywords relevant to a pedagogical document. Our method of document indexing uses the Wikipedia category network as a conceptual taxonomy. A directed acyclic graph is built for each document by mapping terms (one or more words) to a concept in the Wikipedia category network. Properties of the graph are used to weight these concepts. This allows the system to extract so called important concepts from the graph and to disambiguate terms of the document. According to these concepts, topics and keywords are proposed. This method has been evaluated by the librarians on a corpus of french pedagogical documents.
Keywords :
directed graphs; document handling; indexing; Wikipedia category network; concept extraction; conceptual taxonomy; directed acyclic graph; document conceptual indexing; document term disambiguation; indexing support system; pedagogical document; Electronic publishing; Encyclopedias; Indexing; Internet; Linux; Semantics; Directed Acyclic Graph; Document Indexing; Keyword and Topic Extraction; Wikipedia;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
DOI :
10.1109/WI-IAT.2011.104