DocumentCode :
2335691
Title :
Text clustering based on good aggregations
Author :
Hotho, Andreas ; Maedche, Alexander ; Staab, Steffen
Author_Institution :
Inst. fur Angewandte Inf. und Formale Beschreibungsverfahren, Karlsruhe Univ., Germany
fYear :
2001
fDate :
2001
Firstpage :
607
Lastpage :
608
Abstract :
Text clustering typically involves clustering in a high dimensional space, which appears difficult with regard to virtually all practical settings. In addition, given a particular clustering result it is typically very hard to come up with a good explanation of why the text clusters have been constructed the way they are. We propose a new approach for applying background knowledge (in terms of an ontology) during preprocessing in order to improve clustering results and allow for selection between results. The results may be distinguished and explained by the corresponding selection of concepts in the ontology. Our results compare favourably with a sophisticated baseline preprocessing strategy
Keywords :
data mining; data warehouses; pattern clustering; text analysis; background knowledge; good aggregations; high dimensional space; ontology; preprocessing; text clustering; Clustering algorithms; Clustering methods; Heuristic algorithms; Humans; Knowledge management; Measurement standards; Navigation; Ontologies; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
Type :
conf
DOI :
10.1109/ICDM.2001.989577
Filename :
989577
Link To Document :
بازگشت