DocumentCode :
3259078
Title :
Enhancing Text Retrieval Performance using Conceptual Ontological Graph
Author :
Shehata, Shady ; Karray, Fakhri ; Kamel, Mohamed
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont.
fYear :
2006
fDate :
Dec. 2006
Firstpage :
39
Lastpage :
44
Abstract :
Most of the data representation techniques are based on word and/or phrase analysis of the text. The statistical analysis of a term (word or phrase) frequency captures the importance of the term within a document. However, to achieve a more accurate analysis, the underlying data representation should indicate terms that capture the semantics of the text from which the importance of a term in a sentence and in the document can be derived. A new concept-based representation that relies on the analysis of the sentence semantics, rather than, the traditional analysis of the document dataset only is introduced. The proposed conceptual ontological graph representation denotes the terms which contribute to the sentence semantics. Then, each term is chosen based on its position in the proposed representation. Lastly, the selected terms are associated to their documents as features for the purpose of indexing in the text retrieval. Experiments using the proposed conceptual ontological graph representation in text retrieval are conducted. The evaluation of results is relied on two quality measures, the precision and the recall. Both of these quality measures improved when the newly developed representation is used to enhance the performance of the text retrieval
Keywords :
graph theory; information retrieval; ontologies (artificial intelligence); statistical analysis; text analysis; concept-based representation; conceptual ontological graph representation; data representation; document dataset; phrase analysis; sentence semantics; statistical analysis; text retrieval; word analysis; Data analysis; Data mining; Frequency; Indexing; Information retrieval; Natural language processing; Ontologies; Performance analysis; Statistical analysis; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
Type :
conf
DOI :
10.1109/ICDMW.2006.71
Filename :
4063595
Link To Document :
بازگشت