DocumentCode
2247293
Title
An agent-based system framework for multi-slot Web information extraction
Author
Zhang, Shudong ; Qin, Ye ; Yao, Naiming
Author_Institution
Coll. of Inf. Eng., Capital Normal Univ., Beijing, China
Volume
3
fYear
2010
fDate
6-7 March 2010
Firstpage
200
Lastpage
203
Abstract
At present, the scale and diversity of Web information are immense. Acquiring Web information simply relies on search engine which is increasingly unable to meet user needs, thus Web information extraction (WebIE) technology attracts widely attentions. In this paper, a framework of distributed multi-slot WebIE system based on agent is proposed. It includes user agent, mediator agent, wrapper agent, data store agent, page preprocessing agent and corresponding knowledge base. The agents communicate each other and cooperate together to carry out the general goal of the system. Moreover, aiming at multi-slot extraction, the approaches of extraction rule learning and repair are presented, which enable to enhance adaptability of the system.
Keywords
Internet; information retrieval; knowledge based systems; multi-agent systems; agent-based system framework; data store agent; distributed multislot WebIE system; knowledge-base system; mediator agent; multislot Web information extraction; page preprocessing agent; search engine; user agent; wrapper agent; Artificial intelligence; Asia; Automatic control; Data mining; Informatics; Internet; Robot control; Robotics and automation; Search engines; Web pages; Web information extraction; agent; distributed; extraction rule;
fLanguage
English
Publisher
ieee
Conference_Titel
Informatics in Control, Automation and Robotics (CAR), 2010 2nd International Asia Conference on
Conference_Location
Wuhan
ISSN
1948-3414
Print_ISBN
978-1-4244-5192-0
Electronic_ISBN
1948-3414
Type
conf
DOI
10.1109/CAR.2010.5456664
Filename
5456664
Link To Document