DocumentCode :
2728781
Title :
Leveraging Webpage Classification for Data Object Recognition
Author :
Lin, Ling ; Zhou, Lizhu
Author_Institution :
Tsinghua Univ., Beijing
fYear :
2007
fDate :
2-5 Nov. 2007
Firstpage :
667
Lastpage :
670
Abstract :
Data-rich webpages are providing an increasingly important data source for web applications. While the problem of data object recognition is intensively discussed, it is mostly addressed as a separated process from the frontier task of relevant webpage identification. In this paper, we propose a method to leverage the classification result of data-rich webpages for efficient and scalable data object recognition. A novel context information is proposed, which can be inferred from the webpage classification and exploited in the bottom-up data object recognition. Experimental results show that the context information brings a 19% improvement in the running efficiency of the bottom- up data object recognition.
Keywords :
Web sites; pattern recognition; Web applications; Web page classification; Web page identification; context information; data object recognition; data-rich Web pages; Data mining; Decision trees; Labeling; Object recognition; Search engines; Software libraries; Tellurium; Utility programs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3026-0
Type :
conf
DOI :
10.1109/WI.2007.48
Filename :
4427170
Link To Document :
بازگشت