Title : 
PageRank-based Word Sense Induction within Web Search Results Clustering
         
        
            Author : 
Moreno, Jose G. ; Dias, Guilherme
         
        
            Author_Institution : 
GREYC, Normandie Univ., Caen, France
         
        
        
        
        
        
            Abstract : 
Word Sense Induction is an open problem in Natural Language Processing. Many recent works have been addressing this problem with a wide spectrum of strategies based on content analysis. In this paper, we present a sense induction strategy exclusively based on link analysis over the Web. In particular, we explore the idea that the main different senses of a given word share similar linking properties and can be found by performing clustering with link-based similarity metrics. The evaluation results show that PageRank-based sense induction achieves interesting results when compared to state-of-the-art content-based algorithms in the context of Web Search Results Clustering.
         
        
            Keywords : 
Internet; content management; natural language processing; pattern clustering; search engines; PageRank-based word sense induction; Web search results clustering; content analysis; link analysis; link-based similarity metrics; natural language processing; Algorithm design and analysis; Clustering algorithms; Joining processes; Kernel; Measurement; Web pages; Web search; PageRank Clustering; Web Links; Word Sense Induction;
         
        
        
        
            Conference_Titel : 
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
         
        
            Conference_Location : 
London
         
        
        
            DOI : 
10.1109/JCDL.2014.6970227