DocumentCode :
597753
Title :
Biomedical document clustering using ontology based concept weight
Author :
Logeswari, S. ; Premalatha, K.
Author_Institution :
CSE Dept., Bannari Amman Inst. of Technol., Sathyamangalam, India
fYear :
2013
fDate :
4-6 Jan. 2013
Firstpage :
1
Lastpage :
4
Abstract :
Conventional document clustering techniques are mainly based on the existence of keywords and the number of occurrences of it. Most of the term frequency based clustering techniques consider the documents as bag-of-words and ignore the important relationships between the words in the document. Phrase based clustering techniques also capture only the order in which the words occur in a sentence rather than the semantics behind the words. Hence a concept based clustering technique is proposed in this paper. It uses Medical Subject Headings MeSH ontology for concept extraction and concept weight calculation based on the identity and synonymy relationships. K-means algorithm is used for clustering the documents based on the semantic similarity and the results are analyzed.
Keywords :
data mining; medical computing; ontologies (artificial intelligence); pattern clustering; text analysis; K-means algorithm; MeSH ontology; bag-of-words; biomedical document clustering; concept based clustering technique; concept extraction; concept weight calculation; document clustering technique; identity relationship; keywords; medical subject headings; ontology based concept weight; phrase based clustering technique; semantic similarity; synonymy relationship; term frequency based clustering technique; word relationship; word semantics; Abstracts; Computers; Data mining; Indexing; Ontologies; Semantics; Vectors; Document Clustering; Ontology; concept weight; semantic similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Communication and Informatics (ICCCI), 2013 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4673-2906-4
Type :
conf
DOI :
10.1109/ICCCI.2013.6466273
Filename :
6466273
Link To Document :
بازگشت