• DocumentCode
    3504237
  • Title

    Snapshot Processing in Streaming Environments

  • Author

    Zimmerman, Daniel M. ; Chandy, K. Mani

  • Author_Institution
    Dept. of Comput. Sci., California Inst. of Technol., Pasadena, CA
  • fYear
    2006
  • fDate
    28-29 Sept. 2006
  • Firstpage
    319
  • Lastpage
    320
  • Abstract
    Computational issues related to streaming data, and in particular the monitoring and rapid correlation of multiple sources of streaming data, are becoming increasingly important in contexts ranging from business processes to crisis detection. For example, a government system to detect bioterror attacks must correlate multiple streams of possibly low-confidence data from sensors and local and national public health information networks with cues from indicators such as news and government sources indicating geographical locations, tactics and timing of possible attacks. The results of this correlation trigger appropriate responses, such as flagging information for more in-depth analysis or sending alerts to public health officials. Monitoring and correlation applications of this type are ideal for deployment on distributed computing grids, because they have high transaction throughput, require low latency, and can be partitioned into sets of small communicating computations with regular communication patterns. An important consideration in these applications is the need to ensure that, at any given time, computations are carried out on an accurate - or at least close to accurate - picture of the environment being monitored. One way of doing this, which we call snapshot processing, is to treat collections of events that occur at approximately the same time as representing a global snapshot - a valid state - of the environment. Computation on the resulting series of snapshots is much like computation on a real-time video of the entire environment. We briefly describe our model for these stream processing computations and introduce the concept of snapshot processing
  • Keywords
    data analysis; grid computing; media streaming; bioterror attacks; business processes; crisis detection; distributed computing grids; geographical locations; global snapshot; government system; public health information networks; real-time video; snapshot processing; stream processing computations; streaming data; streaming environments; Biosensors; Delay; Distributed computing; Information analysis; Local government; Monitoring; Public healthcare; Sensor systems; Throughput; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Grid Computing, 7th IEEE/ACM International Conference on
  • Conference_Location
    Barcelona
  • Print_ISBN
    1-4244-0343-X
  • Electronic_ISBN
    1-4244-0344-8
  • Type

    conf

  • DOI
    10.1109/ICGRID.2006.311038
  • Filename
    4100495