DocumentCode
2768303
Title
SOA-Based Integration of Text Mining Services
Author
Starlinger, Johannes ; Leitner, Florian ; Valencia, Alfonso ; Leser, Ulf
Author_Institution
Humboldt-Univ. zu Berlin, Berlin, Germany
fYear
2009
fDate
6-10 July 2009
Firstpage
99
Lastpage
106
Abstract
Text Mining has established itself as a valuable tool for knowledge extraction in many commercial and scientific areas. Accordingly, a large number of different methods have been developed focusing on a broad range of different tasks. We report on a novel system architecture that is fundamentally service-based, i.e., it models and implements text mining and knowledge extraction routines as independent, yet federated services. The system has several layers: (1) Base services perform various fundamental extraction tasks. They all implement a fixed interface but keep their particular algorithms and functionality. (2) A metaservice acting as a central access point to those base services, thus providing a homogeneous interface to different algorithms. (3) An aggregation service on top of the metaservice which implements functionality to graphically show, compare, and aggregate the results of different base services. Each layer is accessible as a Web Service and thus ready to be integrated in applications that are higher up in the value chain, such as authoring tools or systems for the automatic construction of knowledge bases. We developed our system with a focus on the mining of Life Science text collections. It is available from http://www.bc-viscon.net.
Keywords
Web services; data mining; knowledge based systems; software architecture; Life Science text collections; SOA-based integration; Web service; base services; central access point; knowledge extraction; metaservice; system architecture; text mining services; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Services - I, 2009 World Conference on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3708-5
Electronic_ISBN
978-0-7695-3708-5
Type
conf
DOI
10.1109/SERVICES-I.2009.100
Filename
5190750
Link To Document