DocumentCode
1691833
Title
A stratified sampling algorithm for landmark windows over data streams
Author
Zhao, Guangyuan ; Zhang, Longbo ; Wang, Fengying ; Li, Caihong ; Wang, Yong
Author_Institution
Sch. of Comput. Sci., Shandong Univ. of Technol., Zibo, China
fYear
2010
Firstpage
2817
Lastpage
2822
Abstract
In many applications, data does not take the form of traditional stored relations, but rather arrives in continuous, rapid, time-varying data streams,and data streams are potentially unbounded in size. Focusing on the problem of sampling from landmark windows over data streams, a new concept, which is called stratified sampling ratio function, is presented. Then a multistage stratified sampling algorithm for landmark window model is introduced. In the algorithm, a dynamic candidate sample set is maintained. When an arrived tuple is determined to enter the sample set and to be deleted from the sample, the arrival time of data items is considered, and the probability for selecting to enter and remain in the sample set of more recent arrived tuples is greater than that of older ones. The theoretic analysis and experiments show that the algorithm is effective and efficient for continuous data streams processing.
Keywords
data handling; landmark windows; stratified sampling ratio function; time-varying data streams; Algorithm design and analysis; Computer science; Educational institutions; Focusing; Heuristic algorithms; Intelligent control; Medical services; data stream; landmark window; stratified sampling algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control and Automation (WCICA), 2010 8th World Congress on
Conference_Location
Jinan
Print_ISBN
978-1-4244-6712-9
Type
conf
DOI
10.1109/WCICA.2010.5554610
Filename
5554610
Link To Document