Title :
iOLAP: A Framework for Analyzing the Internet, Social Networks, and Other Networked Data
Author :
Chi, Yun ; Zhu, Shenghuo ; Hino, Koji ; Gong, Yihong ; Zhang, Yi
Author_Institution :
NEC Labs. America, Cupertino, CA
fDate :
4/1/2009 12:00:00 AM
Abstract :
As the amount of noisy, unorganized, linked data on the Internet increases dramatically, how to efficiently analyze such data becomes a challenging research problem. In this paper, we propose a framework, iOLAP, that offers functionalities for analyzing networked data from Internet, social networks, scientific paper citations, etc. We first identify four main data dimensions that are common in most of networked data, namely people, relation, content, and time. Motivated by the fact that various dimensions of data jointly affect each other, we propose a polyadic factorization approach to directly model all the dimensions simultaneously in a unified framework. We provide detailed theoretical analysis of the new modeling framework. In addition to the theoretical framework, we also present an efficient implementation of the algorithm that takes advantage of the sparseness of data and has time complexity linear in the number of data records in a dataset. We then apply the proposed models to analyzing the blogosphere and personalizing recommendation in paper citations. Extensive experimental studies showed that our framework is able to provide deep insights jointed obtained from various dimensions of networked data.
Keywords :
Internet; citation analysis; data mining; information filtering; social networking (online); Internet; blogosphere analysis; iOLAP; polyadic factorization approach; recommendation personalization; scientific paper citation analysis; social network; time complexity; Information filtering; knowledge management applications; modeling structured; personalization; textual and multimedia data; web mining;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2009.2012912