DocumentCode :
3165541
Title :
Finding Cohesive Clusters for Analyzing Knowledge Communities
Author :
Kandylas, Vasileios ; Upham, S. Phineas ; Ungar, Lyle H.
Author_Institution :
Univ. of Pennsylvania, Philadelphia
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
203
Lastpage :
212
Abstract :
Documents and authors can be clustered into "knowledge communities" based on the overlap in the papers they cite. We introduce a new clustering algorithm, Streemer, which finds cohesive foreground clusters embedded in a diffuse background, and use it to identify knowledge communities as foreground clusters of papers which share common citations. To analyze the evolution of these communities over time, we build predictive models with features based on the citation structure, the vocabulary of the papers, and the affiliations and prestige of the authors. Findings include that scientific knowledge communities tend to grow more rapidly if their publications build on diverse information and if they use a narrow vocabulary.
Keywords :
data mining; information networks; text analysis; Streemer; clustering algorithm; cohesive foreground clusters; predictive models; scientific knowledge communities; text mining; Citation analysis; Clustering algorithms; Communities; Computational Intelligence Society; Data mining; Predictive models; Rhetoric; Text mining; Time measurement; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3018-5
Type :
conf
DOI :
10.1109/ICDM.2007.22
Filename :
4470244
Link To Document :
بازگشت