DocumentCode :
3739238
Title :
Paradigmatic Clustering for NLP
Author :
Julio Santisteban; Tejada-C?rcamo
Author_Institution :
Univ. Catolica San Pablo, Arequipa, Peru
fYear :
2015
Firstpage :
814
Lastpage :
820
Abstract :
How can we retrieve meaningful information from a large and sparse graph?. Traditional approaches focus on generic clustering techniques and discovering dense cumulus in a network graph, however, they tend to omit interesting patterns such as the paradigmatic relations. In this paper, we propose a novel graph clustering technique modelling the relations of a node using the paradigmatic analysis. We exploit node´s relations to extract its existing sets of signifiers. The newly found clusters represent a different view of a graph, which provides interesting insights into the structure of a sparse network graph. Our proposed algorithm PaC (Paradigmatic Clustering) for clustering graphs uses paradigmatic analysis supported by a asymmetric similarity, in contrast to traditional graph clustering methods, our algorithm yields worthy results in tasks of word-sense disambiguation. In addition we propose a novel paradigmatic similarity measure. Extensive experiments and empirical analysis are used to evaluate our algorithm on synthetic and real data.
Keywords :
"Clustering algorithms","Algorithm design and analysis","Bipartite graph","Dolphins","Mutual information","Benchmark testing","Conferences"
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
Electronic_ISBN :
2375-9259
Type :
conf
DOI :
10.1109/ICDMW.2015.233
Filename :
7395752
Link To Document :
بازگشت