DocumentCode :
3310696
Title :
Integrating feature ranking and clustering method to discover person relations in web news
Author :
Lihong Zhao ; Xiaojun Wan ; Yuqian Wu
Author_Institution :
Inst. of Comput. Sci. & Technol., Peking Univ., Beijing, China
Volume :
3
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1821
Lastpage :
1825
Abstract :
Extracting the social relation network of persons is challenging. Discovering significant binary person relations embedded in the web news would be appropriate as the starting point. Prior methods for this task, however, chose to define the relation types first, focused on a few limited types, and always took over a large amount of web information. This paper describes an unsupervised person relation extraction system. This system automatically extracts important people relations from a limited batch of web news, and then proceeds to cluster the instances of these relations and finds discriminative words to represent different clusters. We use various feature ranking strategies for filtering instead of simple bag-of-words representation. We present the experiments evaluation results and give an overview of possible enhancements of this system.
Keywords :
Internet; information filtering; information resources; pattern clustering; social networking (online); unsupervised learning; Web information; Web news; bag-of-words representation; feature clustering method; feature ranking method; feature ranking strategies; information extraction; person relation discovery; social relation network extraction; unsupervised person relation extraction system; Data mining; Entropy; Feature extraction; Filtering; Natural language processing; Noise; Sun; Feature Ranking and Filtering; Unsupervised Person Relation Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
Type :
conf
DOI :
10.1109/FSKD.2011.6019861
Filename :
6019861
Link To Document :
بازگشت