• DocumentCode
    2037027
  • Title

    An approximation method for extracting typical classes from semistructured data

  • Author

    Suzuki, Nobutaka ; Sato, Yoichirou ; Hayase, Michiyoshi

  • Author_Institution
    Fac. of Comput. Sci. & Syst. Eng., Okayama Prefectural Univ., Japan
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    197
  • Lastpage
    200
  • Abstract
    We consider a class extraction problem over semistructured data. A class C is extracted by grouping objects having similar (not necessarily identical) sets of properties into C, where the set of properties of C is the union of those of the objects in C. Let C be an extracted class and o be an object in C. If C has property P but o has no property P value, then P is null within o. An extracted class c is called typical if the number of nulls in C is small against the number of object in C and the number of properties of C. We present the following results. First, we prove that the problem of deciding if a typical class can be extracted from given semistructured data is NP-complete. Second, we present an approximation algorithm for extracting typical classes from given semistructured data. Finally, we briefly discuss a sufficient condition for the approximation algorithm to run efficiently
  • Keywords
    data structures; database theory; approximation algorithm; class extraction; semistructured data; similar object grouping; Approximation methods; Computational Intelligence Society; Computer science; Data engineering; Data mining; Systems engineering and theory; US Department of Transportation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Applications in Non-Traditional Environments, 1999. (DANTE '99) Proceedings. 1999 International Symposium on
  • Conference_Location
    Kyoto
  • Print_ISBN
    0-7695-0496-5
  • Type

    conf

  • DOI
    10.1109/DANTE.1999.844960
  • Filename
    844960