• DocumentCode
    1470418
  • Title

    An empirical study of domain knowledge and its benefits to substructure discovery

  • Author

    Djoko, Surnjani ; Cook, Diane J. ; Holder, Lawrence B.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Texas Univ., Arlington, TX, USA
  • Volume
    9
  • Issue
    4
  • fYear
    1997
  • Firstpage
    575
  • Lastpage
    586
  • Abstract
    Discovering repetitive, interesting, and functional substructures in a structural database improves the ability to interpret and compress the data. However, scientists working with a database in their area of expertise often search for predetermined types of structures or for structures exhibiting characteristics specific to the domain. The paper presents a method for guiding the discovery process with domain specific knowledge. The SUBDUE discovery system is used to evaluate the benefits of using domain knowledge to guide the discovery process. Domain knowledge is incorporated into SUBDUE following a single general methodology to guide the discovery process. Results show that domain specific knowledge improves the search for substructures that are useful to the domain and leads to greater compression of the data. To illustrate these benefits, examples and experiments from the computer programming, computer aided design circuit, and artificially generated domains are presented
  • Keywords
    data compression; data structures; deductive databases; knowledge acquisition; query processing; SUBDUE discovery system; artificially generated domains; computer aided design circuit; computer programming; data compression; domain specific knowledge; functional substructures; knowledge discovery process; predetermined structures; structural database; substructure discovery; Acceleration; Circuits; Data analysis; Data compression; Design automation; Design methodology; Process design; Programming; Transaction databases;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.617051
  • Filename
    617051