DocumentCode :
2772820
Title :
PUB: A Class Description Technique Based on Partial Coverage of Subspace
Author :
Poernomo, Ardian Kristanto ; Gopalkrishnan, Vivekanand
Author_Institution :
Nanyang Technol. Univ., Singapore, Singapore
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
920
Lastpage :
925
Abstract :
A good description of a class should be accurate and interpretable. Previous works describe classes either by analyzing the correlation of each attribute with the class, or by producing rules as in building a classifier. These solutions suffer from issues in accuracy and interpretability. A description naturally consists of sentences, where each sentence consists of a set of terms. Normally, a sentence is defined as a disjunction or conjunction of several terms, each of which specifies a constraint (range/set of values) on an attribute. From the data analysis point of view, a sentence specifies a subspace in the database. In this paper, we create a richer yet interpretable form of a sentence, i.e., a sentence describes an object if any $k$ attributes of that object satisfy the specified constraints. To that end, we design textsc{Pub}, an algorithm that produces descriptions with our form of sentences. While constructing a sentence (within the description), textsc{Pub} finds the optimal range/set of values for each attribute in linear time. We also empirically show that textsc{Pub} is efficient, and able to produce more accurate, concise and interpretable descriptions than current approaches on various real datasets.
Keywords :
data mining; pattern classification; PUB algorithm; class description technique; data analysis; sentence construction; sentences interpretable form; subspace partial coverage; Algorithm design and analysis; Animals; Classification tree analysis; Dairy products; Data analysis; Data mining; Databases; Decision trees; Iris; Performance analysis; class description; classification; fault-tolerant pattern; frequent pattern mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.97
Filename :
5360334
Link To Document :
بازگشت