Title :
A Parallel Workflow for Real-time Correlation and Clustering of High-Frequency Stock Market Data
Author :
Rostoker, Camilo ; Wagner, Alan ; Hoos, Holger
Author_Institution :
Dept. of Comput. Sci., British Columbia Univ., Vancouver, BC
Abstract :
We investigate the design and implementation of a parallel workflow environment targeted towards the financial industry. The system performs real-time correlation analysis and clustering to identify trends within streaming high-frequency intra-day trading data. Our system utilizes state-of-the-art methods to optimize the delivery of computationally-expensive real-time stock market data analysis, with direct applications in automated/algorithmic trading as well as knowledge discovery in high-throughput electronic exchanges. This paper describes the design of the system including the key online parallel algorithms for robust correlation calculation and clique-based clustering using stochastic local search. We evaluate the performance and scalability of the system, followed by a preliminary analysis of the results using data from the Toronto Stock Exchange.
Keywords :
data mining; parallel algorithms; pattern clustering; stochastic processes; stock markets; workflow management software; automated trading; computationally-expensive real-time stock market data analysis; financial industry; high-frequency intra-day trading data; high-throughput electronic exchanges; knowledge discovery; online parallel algorithms; parallel workflow environment; real-time clique-based clustering; real-time correlation analysis; state-of-the-art methods; stochastic local search; Algorithm design and analysis; Clustering algorithms; Consumer electronics; Data analysis; Optimization methods; Parallel algorithms; Performance analysis; Real time systems; Robustness; Stock markets;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
DOI :
10.1109/IPDPS.2007.370216