DocumentCode :
1867030
Title :
Effective Keyword Search for Software Resources Installed in Large-Scale Grid Infrastructures
Author :
Pallis, George ; Katsifodimos, Asterios ; Dikaiakos, Marios D.
Volume :
1
fYear :
2009
fDate :
15-18 Sept. 2009
Firstpage :
482
Lastpage :
489
Abstract :
In this paper, we investigate the problem of supporting keyword-based searching for the discovery of software resources that are installed on the nodes of large-scale, federated Grid computing infrastructures. We address a number of challenges that arise from the unstructured nature of software and the unavailability of software-related metadata on Grid sites. We present Minersoft, a Grid harvester that visits Grid sites, crawls their file-systems, identifies and classifies software resources, and discovers implicit associations between them. The results of Minersoft harvesting are encoded in a weighted, typed graph, named the Software Graph. A number of IR algorithms are used to enrich this graph with structural and content associations, to annotate software resources with keywords, and build inverted indexes to support keyword-based searching for software. Using a real testbed, we present an evaluation study of our approach, using data extracted from a production-quality Grid infrastructure. Experimental results show that our approach achieves high search efficiency.
Keywords :
Clouds; Data mining; File systems; Grid computing; Intelligent agent; Keyword search; Large-scale systems; Search engines; Software maintenance; Software tools; Grid computing; Knowledge Grids; Resource Management; Software retrieval;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT '09. IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Milan, Italy
Print_ISBN :
978-0-7695-3801-3
Electronic_ISBN :
978-1-4244-5331-3
Type :
conf
DOI :
10.1109/WI-IAT.2009.82
Filename :
5286027
Link To Document :
بازگشت