مرکز منطقه ای اطلاع رساني علوم و فناوري - Web information extraction and its application

DocumentCode :

1813577

Title :

Web information extraction and its application

Author :

Peng, Yan ; Zhang, Chenyue

Author_Institution :

Sch. of Manage., Capital Normal Univ., Beijing, China

fYear :

2011

fDate :

15-17 Sept. 2011

Firstpage :

448

Lastpage :

451

Abstract :

Information extraction (IE) addresses the problem of extracting specific information from a collection of documents. The work presented in this paper described an approach of design an information extraction system; put forward basic system architecture. Describe the detail steps of web information extraction, such as web page organize, rule generate and result show. Finally, successfully extracted information is placed in an XML template, which has been designed to capture information needed in the teaching-learning system. Although the work presented in this paper was restricted to HTML course outlines, the concepts and methods are easily applied to other different domains.

Keywords :

Internet; XML; computer aided instruction; hypermedia markup languages; teaching; HTML course outlines; Web information extraction; Web page; XML template; basic system architecture; teaching-learning system; Data mining; Databases; HTML; Service oriented architecture; Web pages; XML; Extraction Rule; HTML; Information Extraction; XML;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cloud Computing and Intelligence Systems (CCIS), 2011 IEEE International Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-61284-203-5

Type :

conf

DOI :

10.1109/CCIS.2011.6045107

Filename :

6045107

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1813577