Title :
Clustering for Complex Structured Data Based on Higher-Order Logic
Author :
Li, Linna ; Yang, Bingru ; Zhang, Fan
Author_Institution :
Sch. of Inf. Eng., Univ. of Sci. & Technol. Beijing, Beijing
Abstract :
Data clustering is an important technique for exploratory data analysis, and has been studied for many years. The existing clustering methods are all designed in attribute-value setting or first-order logic setting. However, attribute-value language can not describe complex structured data. First-order logic can represent certain complex structured data, but both scalability and efficiency of clustering algorithms in this setting are questionable because they need vast scans of data. This paper presents clustering for complex structured data based on higher-order logic. Data is represented by Escher, which is a typed, higher-order logic language. K-means algorithm is investigated with it. Experimental results demonstrate that clustering algorithm which adopts Escher have higher efficiency and better scalability.
Keywords :
computational linguistics; data analysis; data mining; formal logic; pattern clustering; attribute-value language; complex structured data clustering; data mining; data representation; exploratory data analysis; first-order logic; higher-order logic language; Clustering algorithms; Computer science; Data engineering; Data mining; Design methodology; Logic design; Logic programming; Relational databases; Scalability; Software engineering; clustering; complex structured data; higher-order logic;
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
DOI :
10.1109/CSSE.2008.1031