Title :
Jedi: extracting and synthesizing information from the Web
Author :
Huck, Gerald ; Frankhauser, P. ; Aberer, Karl ; Neuhold, Erich
Author_Institution :
German Nat. Res. Center for Inf. Technol., Darmstadt, Germany
Abstract :
Jedi (Java based Extraction and Dissemination of Information) is a lightweight tool for the creation of wrappers and mediators to extract, combine, and reconcile information from several independent information sources. For wrappers it uses attributed grammars, which are evaluated with a fault-tolerant parsing strategy to cope with ambiguous grammars and irregular sources. For mediation it uses a simple generic object-model that can be extended with Java-libraries for specific models such as HTML, XML or the relational model. This paper describes the architecture of Jedi, and then focuses on Jedi´s wrapper generator.
Keywords :
Internet; attribute grammars; information retrieval; Java based Extraction and Dissemination of Information; Jedi; attributed grammars; fault-tolerant parsing; generic object-model; information sources; mediators; relational model; wrappers; Cities and towns; Data mining; Electrical capacitance tomography; Fault tolerance; HTML; Java; Mediation; Weather forecasting; Wrapping; XML;
Conference_Titel :
Cooperative Information Systems, 1998. Proceedings. 3rd IFCIS International Conference on
Conference_Location :
New York, NY, USA
Print_ISBN :
0-8186-8380-5
DOI :
10.1109/COOPIS.1998.706182