DocumentCode
3250841
Title
Extraction techniques for mining services from Web sources
Author
Davulcu, Hasan ; Mukherjee, Saikat ; Ramakrishnan, I.V.
Author_Institution
Dept. of Comput. Sci., State Univ. of New York, Stony Brook, NY, USA
fYear
2002
fDate
2002
Firstpage
601
Lastpage
604
Abstract
The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services on the web continues to proliferate. In this paper we describe new extraction algorithms for mining service directories from web pages. We develop a novel propagation technique for identifying and accumulating all of the attributes related to a service entity in a web page. We provide experimental results of the effectiveness of our extraction techniques by mining a database of veterinarian service providers from web sources.
Keywords
data mining; electronic commerce; learning (artificial intelligence); electronic commerce; extraction algorithms; mining service directories; web pages; web sites; Advertising; Cities and towns; Computer science; Databases; Electronic commerce; Ontologies; Taxonomy; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-1754-4
Type
conf
DOI
10.1109/ICDM.2002.1184008
Filename
1184008
Link To Document