DocumentCode
2016506
Title
Harvesting Large-Scale Grids for Software Resources
Author
Katsifodimos, Asterios ; Pallis, George ; Dikaiakos, Marios D.
Author_Institution
Comput. Sci. Dept., Univ. of Cyprus, Nicosia
fYear
2009
fDate
18-21 May 2009
Firstpage
252
Lastpage
259
Abstract
Grid infrastructures are in operation around the world, federating an impressive collection of computational resources and a wide variety of application software. In this context, it is important to establish advanced software discovery services that could help end-users locate software components suitable to their needs. In this paper, we present the design, architecture and implementation of an open-source keyword-based paradigm for the search of software resources in Grid infrastructures, called Minersoft. A key goal of Minersoft is to annotate automatically all the software resources with keyword-rich metadata. Using advanced Information Retrieval techniques, we locate software resources with respect to users queries. Experiments were conducted in EGEE, one of the largest Grid production services currently in operation. Results showed that Minersoft successfully crawled 12.3 million valid files (620 GB size) and sustained, in most sites, high crawling rates.
Keywords
grid computing; object-oriented programming; public domain software; query processing; software architecture; information retrieval; keyword-rich metadata; large-scale Minersoft grid harvesting system; open-source keyword-based paradigm; software component; software discovery service; software resource search; Application software; Computer architecture; Context-aware services; Documentation; Grid computing; Information retrieval; Large-scale systems; Open source software; Production; Software maintenance; Crawling; Grid Computing; Indexing; Software Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-3935-5
Electronic_ISBN
978-0-7695-3622-4
Type
conf
DOI
10.1109/CCGRID.2009.51
Filename
5071879
Link To Document