Title :
Generating a concept hierarchy for sentiment analysis
Author :
Shi, Bin ; Chang, Kuiyu
Author_Institution :
S.Rajaratnam Sch. of Int. Studies, Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper, we propose an unsupervised machine learning method to automatically construct a product hierarchical concept model based on the online reviews of this product. Our method starts by representing each candidate noun using a feature context vector, which is simply a vector of all its co-occurring neighbors excluding itself. We then applied bisection clustering to hierarchically cluster the context vectors to obtain a cluster hierarchy. Lastly, we proposed and evaluated two methods to label each intermediate clustering node with the most representative member context feature vector. Experiments conducted on 3 sets of on-line reviews (in both Chinese and English) benchmarked qualitatively and quantitatively against a well known existing approach demonstrated the effectiveness and robustness of our approach.
Keywords :
natural language processing; pattern clustering; text analysis; unsupervised learning; vectors; bisection clustering; concept hierarchy; feature context vector; product hierarchical concept model; sentiment analysis; unsupervised machine learning method; Binary trees; Frequency; Labeling; Learning systems; Probability; Robustness; Search engines; Tree data structures;
Conference_Titel :
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-2383-5
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2008.4811294