DocumentCode
2076798
Title
An information theoretic similarity-based learning method for databases
Author
Lee, Changhwan
Author_Institution
Dept. of Comput. Sci. & Eng., Connecticut Univ., Storrs, CT, USA
fYear
1994
fDate
1-4 Mar 1994
Firstpage
99
Lastpage
105
Abstract
Similarity-based learning has been widely and successfully used in some domains. Despite these successes, most similarity measures used in the current literature are defined on limited feature types. Therefore, these similarity measures cannot be applied to the database environment due to the variety of data types that exist. In this paper, we propose a new method of similarity-based learning for databases using information theory. The current similarity measures are improved in several ways. Similarity is defined on every attribute type in the database, and each attribute is assigned a weight depending on its importance with respect to the target attribute. Besides, our nearest neighbor algorithm gives different weights to the selected instances. Our system is implemented and tested on some typical machine learning databases. Our experiments show that the classification accuracy of our system is, in general, superior to that of other learning methods
Keywords
deductive databases; information theory; learning (artificial intelligence); attribute type; classification accuracy; data types; database environment; information theoretic similarity-based learning method; machine learning databases; nearest neighbor algorithm; similarity measures; weights; Clustering algorithms; Computer science; Current measurement; Data engineering; Information theory; Learning systems; Machine learning; Machine learning algorithms; Performance analysis; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence for Applications, 1994., Proceedings of the Tenth Conference on
Conference_Location
San Antonia, TX
Print_ISBN
0-8186-5550-X
Type
conf
DOI
10.1109/CAIA.1994.323686
Filename
323686
Link To Document