• DocumentCode
    1691833
  • Title

    A stratified sampling algorithm for landmark windows over data streams

  • Author

    Zhao, Guangyuan ; Zhang, Longbo ; Wang, Fengying ; Li, Caihong ; Wang, Yong

  • Author_Institution
    Sch. of Comput. Sci., Shandong Univ. of Technol., Zibo, China
  • fYear
    2010
  • Firstpage
    2817
  • Lastpage
    2822
  • Abstract
    In many applications, data does not take the form of traditional stored relations, but rather arrives in continuous, rapid, time-varying data streams,and data streams are potentially unbounded in size. Focusing on the problem of sampling from landmark windows over data streams, a new concept, which is called stratified sampling ratio function, is presented. Then a multistage stratified sampling algorithm for landmark window model is introduced. In the algorithm, a dynamic candidate sample set is maintained. When an arrived tuple is determined to enter the sample set and to be deleted from the sample, the arrival time of data items is considered, and the probability for selecting to enter and remain in the sample set of more recent arrived tuples is greater than that of older ones. The theoretic analysis and experiments show that the algorithm is effective and efficient for continuous data streams processing.
  • Keywords
    data handling; landmark windows; stratified sampling ratio function; time-varying data streams; Algorithm design and analysis; Computer science; Educational institutions; Focusing; Heuristic algorithms; Intelligent control; Medical services; data stream; landmark window; stratified sampling algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation (WCICA), 2010 8th World Congress on
  • Conference_Location
    Jinan
  • Print_ISBN
    978-1-4244-6712-9
  • Type

    conf

  • DOI
    10.1109/WCICA.2010.5554610
  • Filename
    5554610