• DocumentCode
    606146
  • Title

    Improvisioning Hadoop in Globus Toolkit

  • Author

    Hema, S. ; Jaganathan, Suresh

  • Author_Institution
    Dept. of Comput. Sci. & Eng, Sri Sivasubramaniya Nadar Coll. of Eng., Chennai, India
  • fYear
    2013
  • fDate
    20-21 March 2013
  • Firstpage
    1082
  • Lastpage
    1088
  • Abstract
    This Grid is a computing centric distributed environment that integrates computing, storage and other resources to enable execution of applications that cannot run on a single resource. The vision of grid computing is that, every users can gain access to the computing resources inspite of its location or underlying technologies. Resource management and scheduling plays a critical role in achieving high utilization of resources in grid computing environments. The goal of scheduling is to achieve maximum throughput with available computing resources. Condor-G is a High-throughput scheduler supported by Grid Resource Allocation Management (GRAM) component of Globus. It uses non-dedicated resources to schedule the jobs. Processing huge amounts of data on large and scalable computational infrastructures is gaining increasing importance. Apache Hadoop, a data centric framework allows distributed processing of large sets of data across clusters of computers offering local computation, storage and failure handling thereby delivering a highly available service. In this paper, we have proposed a concept of Improvisioning Hadoop in Globus Toolkit, for efficient usage of resources, job management (scheduling) and faster execution. The performance of the proposed toolkit will improve in-terms of Job Execution, Resource Handling and Cluster Formation. electronic document is a “live” template and already defines the components of your paper [title, text, heads, etc.] in its style sheet.
  • Keywords
    grid computing; pattern clustering; resource allocation; scheduling; software fault tolerance; Apache Hadoop; Condor-G; GRAM; Globus toolkit; cluster formation; computing centric distributed environment; data centric framework; electronic document; failure handling; grid computing environments; grid resource allocation management; high-throughput scheduler; job execution; resource handling; File systems; Servers; Apache Hadoop; Condor-G; GRAM; Scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits, Power and Computing Technologies (ICCPCT), 2013 International Conference on
  • Conference_Location
    Nagercoil
  • Print_ISBN
    978-1-4673-4921-5
  • Type

    conf

  • DOI
    10.1109/ICCPCT.2013.6528902
  • Filename
    6528902