• DocumentCode
    1175651
  • Title

    Mining Web pages for data records

  • Author

    Liu, Bing ; Grossman, Robert ; Zhai, Yanhong

  • Author_Institution
    Illinois Univ., Chicago, IL, USA
  • Volume
    19
  • Issue
    6
  • fYear
    2004
  • Firstpage
    49
  • Lastpage
    55
  • Abstract
    Data mining to extract information from Web pages can help provide value-added services. The MDR (mining data records) system exploits Web page structure and uses a string-matching algorithm to mine contiguous and noncontiguous data records.
  • Keywords
    Internet; Web sites; data mining; information retrieval; records management; string matching; MDR; Web page structure; data mining; mining data records; string-matching algorithm; value-added services; Data mining; Databases; HTML; Humans; Intelligent systems; Ontologies; Supervised learning; Training data; Web pages; Web data; Web data extraction; Web mining; data mining; databases;
  • fLanguage
    English
  • Journal_Title
    Intelligent Systems, IEEE
  • Publisher
    ieee
  • ISSN
    1541-1672
  • Type

    jour

  • DOI
    10.1109/MIS.2004.68
  • Filename
    1363734