• DocumentCode
    484519
  • Title

    Mining Scientific Data using the Internet as the Computer

  • Author

    Graves, Sara ; Ramachandran, Rahul ; Lynnes, Christopher ; Maskey, Manil ; Keiser, Ken ; Pham, Long

  • Author_Institution
    Univ. of Alabama in Huntsville, Huntsville, AL
  • Volume
    4
  • fYear
    2008
  • fDate
    7-11 July 2008
  • Abstract
    This paper describes approaches and methodologies facilitating the analysis of large amounts of distributed scientific data. The existence of full-featured analysis tools, such as the Algorithm Development and Mining (ADaM) toolkit and online data repositories now provide easy access and analysis capabilities to large amounts of data. However, there are obstacles to getting the analysis tools and the data together in a workable environment. Does one bring the data to the tools or deploy the tools close to the data? The large size of many current Earth science datasets incurs significant overhead in network transfer for analysis workflows, even with the current advanced networking capabilities. We are developing two solutions for this problem that address different analysis scenarios. The first is a Data Center Deployment of the analysis services for large data selections, orchestrated by a remotely defined analysis workflow. The second is a Data Mining Center approach of providing a cohesive analysis solution for smaller subsets of data. The two approaches can be complementary and thus provide flexibility for researchers to exploit the best solution for their data requirements.
  • Keywords
    Web services; data mining; geophysical techniques; geophysics computing; ADaM toolkit; Algorithm Development and Mining; Data Center Deployment; Data Mining Center approach; Internet; cohesive analysis; distributed scientific data; full-featured analysis tools; network transfer; online data; remotely defined analysis workflow; Algorithm design and analysis; Computer architecture; Data analysis; Data mining; Data processing; Geoscience; Iterative algorithms; Service oriented architecture; Web and internet services; Web services; Data Mining; Science Analysis; Web Services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Geoscience and Remote Sensing Symposium, 2008. IGARSS 2008. IEEE International
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-2807-6
  • Electronic_ISBN
    978-1-4244-2808-3
  • Type

    conf

  • DOI
    10.1109/IGARSS.2008.4779713
  • Filename
    4779713