• DocumentCode
    2076798
  • Title

    An information theoretic similarity-based learning method for databases

  • Author

    Lee, Changhwan

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Connecticut Univ., Storrs, CT, USA
  • fYear
    1994
  • fDate
    1-4 Mar 1994
  • Firstpage
    99
  • Lastpage
    105
  • Abstract
    Similarity-based learning has been widely and successfully used in some domains. Despite these successes, most similarity measures used in the current literature are defined on limited feature types. Therefore, these similarity measures cannot be applied to the database environment due to the variety of data types that exist. In this paper, we propose a new method of similarity-based learning for databases using information theory. The current similarity measures are improved in several ways. Similarity is defined on every attribute type in the database, and each attribute is assigned a weight depending on its importance with respect to the target attribute. Besides, our nearest neighbor algorithm gives different weights to the selected instances. Our system is implemented and tested on some typical machine learning databases. Our experiments show that the classification accuracy of our system is, in general, superior to that of other learning methods
  • Keywords
    deductive databases; information theory; learning (artificial intelligence); attribute type; classification accuracy; data types; database environment; information theoretic similarity-based learning method; machine learning databases; nearest neighbor algorithm; similarity measures; weights; Clustering algorithms; Computer science; Current measurement; Data engineering; Information theory; Learning systems; Machine learning; Machine learning algorithms; Performance analysis; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence for Applications, 1994., Proceedings of the Tenth Conference on
  • Conference_Location
    San Antonia, TX
  • Print_ISBN
    0-8186-5550-X
  • Type

    conf

  • DOI
    10.1109/CAIA.1994.323686
  • Filename
    323686