DocumentCode
2844762
Title
Probability Density Estimation over evolving data streams using Tilted Parzen Window
Author
Hong Shen ; Xiao-Long Yan
Author_Institution
Dept. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei
fYear
2008
fDate
6-9 July 2008
Firstpage
585
Lastpage
589
Abstract
Probability density estimation is a very important technology which has been widely used in data mining and data analysis. In this paper, we generalize the traditional Parzen window method to data streams and propose a new method of tilted Parzen window (TPW) for probability density estimation. To adapt to the evolvement of the data streams, we use the tilted window size that is proportional to datapsilas arrival time instead of the fixed window size. Theoretical analysis shows that the tilted Parzen window method is a valid method for estimating the probability density function (pdf) for data streams. We also propose a new strategy for discarding the historical data in data streams. We prove that this strategy can describe the probability density changes more accurately than the conventional discarding strategy. Empirical results on synthetic data set demonstrate the effectiveness and efficiency of this method.
Keywords
data analysis; probability; data streams; probability density estimation; probability density function; synthetic data set; tilted Parzen window method; Australia; Data analysis; Data mining; Hard disks; Merging; Probability density function; Real time systems; Streaming media; Telephony; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Computers and Communications, 2008. ISCC 2008. IEEE Symposium on
Conference_Location
Marrakech
ISSN
1530-1346
Print_ISBN
978-1-4244-2702-4
Electronic_ISBN
1530-1346
Type
conf
DOI
10.1109/ISCC.2008.4625751
Filename
4625751
Link To Document