Title :
Mining Web pages for data records
Author :
Liu, Bing ; Grossman, Robert ; Zhai, Yanhong
Author_Institution :
Illinois Univ., Chicago, IL, USA
Abstract :
Data mining to extract information from Web pages can help provide value-added services. The MDR (mining data records) system exploits Web page structure and uses a string-matching algorithm to mine contiguous and noncontiguous data records.
Keywords :
Internet; Web sites; data mining; information retrieval; records management; string matching; MDR; Web page structure; data mining; mining data records; string-matching algorithm; value-added services; Data mining; Databases; HTML; Humans; Intelligent systems; Ontologies; Supervised learning; Training data; Web pages; Web data; Web data extraction; Web mining; data mining; databases;
Journal_Title :
Intelligent Systems, IEEE
DOI :
10.1109/MIS.2004.68