DocumentCode
3298974
Title
Enumeration of maximal clique for mining spatial co-location patterns
Author
Al-Naymat, Ghazi
Author_Institution
Univ. of Sydney, Sydney
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
126
Lastpage
133
Abstract
This paper presents a systematic approach to mine co- location patterns in Sloan Digital Sky Survey (SDSS) data. SDSS Data Release 5 (DR5) contains 3.6 TB of data. Availability of such large amount of useful data is an opportunity for application of data mining techniques to generate interesting information. The major reason for the lack of such data mining applications in SDSS is the unavailability of data in a suitable format. This work illustrates a procedure to obtain additional galaxy types from an available attributes and transform the data into maximal cliques of galaxies which in turn can be used as transactions for data mining applications. An efficient algorithm GridClique is proposed to generate maximal cliques from large spatial databases. It should be noted that the full general problem of extracting a maximal clique from a graph is known as NP-Hard. The experimental results show that the GridClique algorithm successfully generates all maximal cliques in the SDSS data and enables the generation of useful co-location patterns.
Keywords
data mining; grid computing; very large databases; GridClique algorithm; data mining techniques; large spatial databases; maximal clique enumeration; sloan digital sky survey data; spatial colocation patterns mining; Astronomy; Australia; Brightness; Catalogs; Data mining; Focusing; Information technology; Mesh generation; Spatial databases; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on
Conference_Location
Doha
Print_ISBN
978-1-4244-1967-8
Electronic_ISBN
978-1-4244-1968-5
Type
conf
DOI
10.1109/AICCSA.2008.4493526
Filename
4493526
Link To Document