DocumentCode :
2547971
Title :
Learning social networks using multiple resampling method
Author :
Makrehchi, Masoud ; Kamel, Mohamed S.
Author_Institution :
Univ. of Waterloo, Waterloo
fYear :
2007
fDate :
7-10 Oct. 2007
Firstpage :
406
Lastpage :
411
Abstract :
Automatic building of social networks requires extracting pair-wise relations of the individuals. In this paper, supervised learning of social networks from a set of documents is proposed. Given a small subset of relations between the individuals, the problem of learning social network is translated into a text classification problem. Relation between each pair of individuals is represented by a vector of words produced from merging all documents associated with these two individuals. The known relation is used as a label for the relation vector. The merged documents and their given labels, are used as training data. By this transformation, a text classifier such as SVM can be used for learning the unknown relations. We show that there is a link between the intrinsic sparsity of social networks and class distribution imbalance of the training data. In order to re-balance the unbalanced training data, a multiple resampling method, including undersampling of the majority and oversampling of the minority class, is employed. The proposed framework is applied to a friend of a friend (FOAF) data set and evaluated by the macro-averaged F-measure.
Keywords :
classification; learning (artificial intelligence); merging; sampling methods; social sciences computing; support vector machines; text analysis; SVM; document merging; multiple resampling method; social network; supervised learning; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
978-1-4244-0990-7
Electronic_ISBN :
978-1-4244-0991-4
Type :
conf
DOI :
10.1109/ICSMC.2007.4414079
Filename :
4414079
Link To Document :
بازگشت