• DocumentCode
    3671502
  • Title

    A conceptual design of a web information extraction and data analysis learning framework

  • Author

    Chun-Hsiung Tseng;Yung-Hui Chen;Yan-Ru Jiang

  • Author_Institution
    Department of Information, Management, Nanhua University, Chiayi County, R.O.C.
  • fYear
    2015
  • Firstpage
    124
  • Lastpage
    127
  • Abstract
    The Web is flooded with data. However, there is a huge gap between data and information. Collecting, normalization, and analyzation are required steps to transform data into information. However, HTML is document-centric rather than data-centric. Extracting large amounts of data from the Web is a time consuming and tedious task, but information technologies can only provide little help, especially when users lack of domain knowledge. In this research, the conceptual design of a Web information extraction and data analysis framework is proposed. The framework helps data analysts go through the required steps. Furthermore, our design is suitable for inexperienced beginners in data analyzation field since some assistant modules have been considered.
  • Keywords
    "Data mining","Web pages","Grammar","Software agents","Data analysis"
  • Publisher
    ieee
  • Conference_Titel
    Ubi-Media Computing (UMEDIA), 2015 8th International Conference on
  • Type

    conf

  • DOI
    10.1109/UMEDIA.2015.7297441
  • Filename
    7297441