• DocumentCode
    3250841
  • Title

    Extraction techniques for mining services from Web sources

  • Author

    Davulcu, Hasan ; Mukherjee, Saikat ; Ramakrishnan, I.V.

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, Stony Brook, NY, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    601
  • Lastpage
    604
  • Abstract
    The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services on the web continues to proliferate. In this paper we describe new extraction algorithms for mining service directories from web pages. We develop a novel propagation technique for identifying and accumulating all of the attributes related to a service entity in a web page. We provide experimental results of the effectiveness of our extraction techniques by mining a database of veterinarian service providers from web sources.
  • Keywords
    data mining; electronic commerce; learning (artificial intelligence); electronic commerce; extraction algorithms; mining service directories; web pages; web sites; Advertising; Cities and towns; Computer science; Databases; Electronic commerce; Ontologies; Taxonomy; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1184008
  • Filename
    1184008