DocumentCode :
2864952
Title :
Parameter-free spatial data mining using MDL
Author :
Papadimitriou, Spiros ; Gionis, Aristides ; Tsaparas, Panayiotis ; Väisänen, Risto A. ; Mannila, Heikki ; Faloutsos, Christos
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
Consider spatial data consisting of a set of binary features taking values over a collection of spatial extents (grid cells). We propose a method that simultaneously finds spatial correlation and feature co-occurrence patterns, without any parameters. In particular, we employ the minimum description length (MDL) principle coupled with a natural way of compressing regions. This defines what "good" means: a feature co-occurrence pattern is good, if it helps us better compress the set of locations for these features. Conversely, a spatial correlation is good, if it helps us better compress the set of features in the corresponding region. Our approach is scalable for large datasets (both number of locations and of features). We evaluate our method on both real and synthetic datasets.
Keywords :
data mining; visual databases; feature cooccurrence patterns; large datasets; minimum description length principle; parameter-free spatial data mining; spatial correlation; Bioinformatics; Biological materials; Character generation; Cities and towns; Data mining; Hospitals; Hurricanes; NASA; Space technology; Storms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.117
Filename :
1565698
Link To Document :
بازگشت