DocumentCode :
2119495
Title :
Conceptualization Effects on MEDLINE Documents Classification Using Rocchio Method
Author :
Albitar, S. ; Fournier, Sebastien ; Espinasse, Bernard
Author_Institution :
LSIS, Aix Marseille Univ., Marseille, France
Volume :
1
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
462
Lastpage :
466
Abstract :
The aim of this paper is to propose a supervised text classification method for the biomedical domain using semantic resources. We choose the traditional text classification method, Rocchio, for its scalability and extendibility with semantic knowledge. This paper proposes to integrate semantic aspects into Rocchio through a conceptualization task. This conceptualization is realized by mapping terms that are extracted from text to their corresponding concepts in the UMLS® Metathesaurus® in order to take meaning into consideration during text classification. The proposed classifier is tested on the Ohsumed text corpus, which is composed of abstracts of biomedical articles retrieved from the MEDLINE® database. The effects of Conceptualization on Rocchio´s performance are discussed according to different standard similarity measures and to a variety of conceptualization strategies.
Keywords :
database management systems; medical computing; pattern classification; text analysis; thesauri; MEDLINE database; MEDLINE documents classification; Ohsumed text corpus; Rocchio method; UMLS metathesaurus; biomedical articles; biomedical domain; classifier; conceptualization effects; conceptualization strategy; conceptualization task; semantic aspects; semantic knowledge; semantic resources; similarity measures; supervised text classification method; term mapping; Classification; Information retrieval; Rocchio; Semantic classification; Similarity measures; conceptualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
Type :
conf
DOI :
10.1109/WI-IAT.2012.210
Filename :
6511925
Link To Document :
بازگشت