DocumentCode :
1203253
Title :
A conceptual clustering algorithm for database schema design
Author :
Beck, Howard W. ; Anwar, Tarek ; Navathe, Shamkant B.
Author_Institution :
Dept. of Agric. Eng., Florida Univ., Gainesville, FL, USA
Volume :
6
Issue :
3
fYear :
1994
fDate :
6/1/1994 12:00:00 AM
Firstpage :
396
Lastpage :
411
Abstract :
Conceptual clustering techniques based on current theories of categorization provide a way to design database schemas that more accurately represent classes. An approach is presented in which classes are treated as complex clusters of concepts rather than as simple predicates. An important service provided by the database is determining whether a particular instance is a member of a class. A conceptual clustering algorithm based on theories of categorization aids in building classes by grouping related instances and developing class descriptions. The resulting database schema addresses a number of properties of categories, including default values and prototypes, analogical reasoning, exception handling, and family resemblance. Class cohesion results from trying to resolve conflicts between building generalized class descriptions and accommodating members of the class that deviate from these descriptions. This is achieved by combining techniques from machine learning, specifically explanation-based learning and case-based reasoning. A subsumption function is used to compare two class descriptions. A realization function is used to determine whether an instance meets an existing class description. A new function, INTERSECT, is introduced to compare the similarity of two instances. INTERSECT is used in defining an exception condition. Exception handling results in schema modification. This approach is applied to the database problems of schema integration, schema generation, query processing, and view creation
Keywords :
case-based reasoning; data structures; database management systems; database theory; exception handling; learning (artificial intelligence); operating systems (computers); query processing; INTERSECT; analogical reasoning; case-based reasoning; categorization; class cohesion; class descriptions; complex clusters; conceptual clustering algorithm; database schema design; default values; exception condition; exception handling; explanation-based learning; family resemblance; machine learning; query processing; realization function; schema generation; schema integration; schema modification; subsumption function; view creation; Algorithm design and analysis; Clustering algorithms; Councils; Data models; Databases; Industrial relations; Machine learning; Prototypes; Query processing; Taxonomy;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.334862
Filename :
334862
Link To Document :
بازگشت