• DocumentCode
    2222688
  • Title

    Challenges in large scale distributed computing: bioinformatics

  • Author

    Disz, Terry ; Kubal, Mike ; Olson, Robert ; Overbeek, Ross ; Stevens, Rick

  • Author_Institution
    Argonne Nat. Lab., Chicago Univ., IL, USA
  • fYear
    2005
  • fDate
    38557
  • Firstpage
    57
  • Lastpage
    65
  • Abstract
    The amount of genomic data available for study is increasing at a rate similar to that of Moore´s law. This deluge of data is challenging bioinformaticians to develop newer, faster and better algorithms for analysis and examination of this data. The growing availability of large scale computing grids coupled with high-performance networking is challenging computer scientists to develop better, faster methods of exploiting parallelism in these biological computations and deploying them across computing grids. In this paper, we describe two computations that are required to be run frequently and which require large amounts of computing resource to complete in a reasonable time. The data for these computations are very large and the sequential computational time can exceed thousands of hours. We show the importance and relevance of these computations, the nature of the data and parallelism and we show how we are meeting the challenge of efficiently distributing and managing these computations in the SEED project.
  • Keywords
    biology computing; genetics; grid computing; Moore law; SEED project; bioinformatics; biological computation; computing grid; genomic data; high-performance networking; large scale distributed computing; sequential computational time; Bioinformatics; Biology computing; Computer networks; Concurrent computing; Distributed computing; Genomics; Grid computing; Large-scale systems; Moore´s Law; Parallel processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Challenges of Large Applications in Distributed Environments, 2005. CLADE 2005. Proceedings
  • Print_ISBN
    0-7803-9043-1
  • Type

    conf

  • DOI
    10.1109/CLADE.2005.1520902
  • Filename
    1520902