• DocumentCode
    2816622
  • Title

    Confidence on approximate query in large datasets

  • Author

    Ford, Charles Wesley ; Chiang, Chia-Chu ; Wu, Hao ; Chilka, Radhika R. ; Talburt, John

  • Author_Institution
    Dept. of Comput. Sci., Arkansas Univ., Little Rock, AR, USA
  • Volume
    2
  • fYear
    2004
  • fDate
    5-7 April 2004
  • Firstpage
    480
  • Abstract
    The evolution of the World Wide Web has brought us enormous amounts of information for business and research use. Design and implementation of an automated system for Web data mining has become important for companies wishing to utilize useful information from the Web. We attempt to describe confidence on approximate queries on large datasets, which is done in the context of an automated system for Web data mining. The system has been designed to identify, extract, filter, and analyze data from Web resources. An approach to evaluating the quality of extracted Web data is also discussed. This is an exploratory study of Web data retrieval and Web data analysis.
  • Keywords
    Internet; Web sites; data analysis; data mining; information filters; query processing; very large databases; Internet; Web data analysis; Web data extraction; Web data filtering; Web data identification; Web data mining; World Wide Web; approximate query confidence; automated system design; large datasets; Application software; Data analysis; Data mining; Databases; Information filtering; Information filters; Information retrieval; Search engines; Statistics; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
  • Print_ISBN
    0-7695-2108-8
  • Type

    conf

  • DOI
    10.1109/ITCC.2004.1286700
  • Filename
    1286700