Title :
Summary Creation for Information Discovery in Distributed Systems
Author :
Caminero, Agustín C. ; Huedo, Eduardo ; Rana, Omer ; Llorente, Ignacio M. ; Caminero, Blanca ; Carrión, Carmen
Author_Institution :
Nat. Univ. of Distance Educ., Spain
Abstract :
In current distributed systems, such as Grids, Clouds, or P2P systems, the amount of information to handle influences the way the system is managed. In P2P systems containing large quantities of data, or in Grid systems containing a large number of (often heterogeneous) resources, information about data or resources must be spread through the system in an efficient way in order to allow them to be found. An information discovery technique based on data summarization, via clustering, is presented. These summaries can be used to classify information to provide users with greater insight about documents or computing resources compared to raw data. Also, meta-schedulers or brokers would benefit from the proposed technique due to the fact that they would have to deal with less data from resources, thus aiding to the scalability of the system. An evaluation of the approach is subsequently provided to identify the impact of choosing particular parameters to be used as part of the summary.
Keywords :
data mining; grid computing; pattern clustering; peer-to-peer computing; P2P system; computing resource; data clustering; data summarization; distributed system; grid system; information classification; information discovery; meta scheduler; Clustering algorithms; Computational modeling; Databases; Electronic mail; Peer to peer computing; Proposals; Scalability; classification; distributed systems; information discovery; summary creation;
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
Conference_Location :
Ayia Napa
Print_ISBN :
978-1-4244-9682-2
DOI :
10.1109/PDP.2011.18