• DocumentCode
    2016506
  • Title

    Harvesting Large-Scale Grids for Software Resources

  • Author

    Katsifodimos, Asterios ; Pallis, George ; Dikaiakos, Marios D.

  • Author_Institution
    Comput. Sci. Dept., Univ. of Cyprus, Nicosia
  • fYear
    2009
  • fDate
    18-21 May 2009
  • Firstpage
    252
  • Lastpage
    259
  • Abstract
    Grid infrastructures are in operation around the world, federating an impressive collection of computational resources and a wide variety of application software. In this context, it is important to establish advanced software discovery services that could help end-users locate software components suitable to their needs. In this paper, we present the design, architecture and implementation of an open-source keyword-based paradigm for the search of software resources in Grid infrastructures, called Minersoft. A key goal of Minersoft is to annotate automatically all the software resources with keyword-rich metadata. Using advanced Information Retrieval techniques, we locate software resources with respect to users queries. Experiments were conducted in EGEE, one of the largest Grid production services currently in operation. Results showed that Minersoft successfully crawled 12.3 million valid files (620 GB size) and sustained, in most sites, high crawling rates.
  • Keywords
    grid computing; object-oriented programming; public domain software; query processing; software architecture; information retrieval; keyword-rich metadata; large-scale Minersoft grid harvesting system; open-source keyword-based paradigm; software component; software discovery service; software resource search; Application software; Computer architecture; Context-aware services; Documentation; Grid computing; Information retrieval; Large-scale systems; Open source software; Production; Software maintenance; Crawling; Grid Computing; Indexing; Software Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-3935-5
  • Electronic_ISBN
    978-0-7695-3622-4
  • Type

    conf

  • DOI
    10.1109/CCGRID.2009.51
  • Filename
    5071879