• DocumentCode
    2620919
  • Title

    CRAB: A CMS application for distributed analysis

  • Author

    Codispoti, G. ; Cinquilli, M. ; Fanfani, A. ; Fanzago, F. ; Farina, F. ; Kavka, C. ; Lacaprara, S. ; Miccio, V. ; Spiga, D. ; Vaandering, E.

  • Author_Institution
    INFN and UniversitÃ\xa0 di Bologna, I-40127, Italy
  • fYear
    2008
  • fDate
    19-25 Oct. 2008
  • Firstpage
    817
  • Lastpage
    822
  • Abstract
    Starting from 2008, the CMS experiment will produce several Pbytes of data every year, to be distributed over many computing centers geographically distributed in different countries. The CMS computing model defines how the data has to be distributed and accessed in order to enable physicists to run efficiently their analysis over the data. The analysis will be thus performed in a distributed way using Grid infrastructure. CRAB (CMS Remote Analysis Builder) is a specific tool, designed and developed by the CMS collaboration, that allows a transparent access to distributed data to end physicist. CRAB interacts with the local user environment, the CMS Data Management services and with the Grid middleware: it takes care of the data and resources discovery; it splits the user task in several analysis processes (jobs) and distribute and parallelize them over different Grid environments; it takes care of the process tracking and output handling. Very limited knowledge of underlying technical details are required to the end user. The tool can be used as a direct interface to the computing system or can delegate the task to a server, which takes care of the user jobs handling, providing services as automatic resubmission in case of failures and notification to the user of the task status. Its current implementation is able to interact with WLCG, gLite and OSG Grid middlewares. Furthermore it allows in the very same way the access to local data and batch systems such as LSF. CRAB has been in production and in routine use by end-users since Spring 2004. It has been extensively used in studies to prepare the Physics Technical Design Report, in the analysis of reconstructed event samples generated during the Computing Software and Analysis Challenges and in the preliminary cosmic rays data taking. The CRAB architecture and the usage inside the CMS community will be described in detail, as well as the current status and future development.
  • Keywords
    Collaborative tools; Collision mitigation; Computer interfaces; Data analysis; Distributed computing; Environmental management; Middleware; Performance analysis; Physics computing; Resource management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Nuclear Science Symposium Conference Record, 2008. NSS '08. IEEE
  • Conference_Location
    Dresden, Germany
  • ISSN
    1095-7863
  • Print_ISBN
    978-1-4244-2714-7
  • Electronic_ISBN
    1095-7863
  • Type

    conf

  • DOI
    10.1109/NSSMIC.2008.4774652
  • Filename
    4774652