DocumentCode
1175651
Title
Mining Web pages for data records
Author
Liu, Bing ; Grossman, Robert ; Zhai, Yanhong
Author_Institution
Illinois Univ., Chicago, IL, USA
Volume
19
Issue
6
fYear
2004
Firstpage
49
Lastpage
55
Abstract
Data mining to extract information from Web pages can help provide value-added services. The MDR (mining data records) system exploits Web page structure and uses a string-matching algorithm to mine contiguous and noncontiguous data records.
Keywords
Internet; Web sites; data mining; information retrieval; records management; string matching; MDR; Web page structure; data mining; mining data records; string-matching algorithm; value-added services; Data mining; Databases; HTML; Humans; Intelligent systems; Ontologies; Supervised learning; Training data; Web pages; Web data; Web data extraction; Web mining; data mining; databases;
fLanguage
English
Journal_Title
Intelligent Systems, IEEE
Publisher
ieee
ISSN
1541-1672
Type
jour
DOI
10.1109/MIS.2004.68
Filename
1363734
Link To Document