Title :
Lightweight workflow engine based on HADOOP and OSGI
Author :
Shengmei Luo ; Lixia Liu ; Juan Yang ; Di Zhang
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
The growth of data used by data-intensive computations has out-paced the growth of the power of a single processor. In this paper, we propose a high performance and scalability workflow engine, a lightweight parallel and distributed computing platform based on HADOOP and OSGI. Workflow engine provides flexible and scalable OSGI-based interfaces which user can implement to define data processing functions and expand functions in HADOOP ecosystem. Workflow engine consists of invoking interfaces, scheduler engine, optimizer and fault-tolerance manager. Given a workflow process, the engine analyzes data dependencies among nodes, then dispatches them to Map Reduce clusters based on the current status of the system. This paper also describes the design and strategy in detail on implementation processes, key points, relationships of each participants and etc. Our experiment demonstrates that workflow engine can significantly improve the performance of workflow execution.
Keywords :
parallel programming; public domain software; software fault tolerance; user interfaces; workflow management software; Hadoop; MapReduce clusters; OSGI-based interfaces; data dependency analysis; data processing functions; data-intensive computations; distributed computing platform; fault-tolerance manager; lightweight parallel computing platform; lightweight workflow engine; optimizer; scalability workflow engine; scheduler engine; Cloud computing; Computer architecture; Containers; Engines; Fault tolerance; Fault tolerant systems; Servers; Fault-tolerance; HADOOP; OSGI; RPC; Scheduler; Workflow Engine;
Conference_Titel :
Broadband Network & Multimedia Technology (IC-BNMT), 2013 5th IEEE International Conference on
Conference_Location :
Guilin
DOI :
10.1109/ICBNMT.2013.6823954