DocumentCode :
3076878
Title :
Cowic: A Column-Wise Independent Compression for Log Stream Analysis
Author :
Hao Lin ; Jingyu Zhou ; Bin Yao ; Minyi Guo ; Jie Li
Author_Institution :
Sch. of Software, Shanghai Jiao Tong Univ., Shanghai, China
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
21
Lastpage :
30
Abstract :
Nowadays massive log streams are generated from many Internet and cloud services. Storing log streams consumes a large amount of disk space and incurs high cost. Traditional compression methods can be applied to reduce storage cost, but are inefficient for log analysis, because fetching relevant log entries from compressed data often requires retrieval and decompression of large blocks of data. We propose a column-wise compression approach for well-formatted log streams, where each log entry can be independently compressed or decompressed for analysis. Specifically, we separate a log entry into several columns and compress each column with different models. We have implemented our approach as a library and integrated it into two applications, a log search system and a log joining system. Experimental results show that our compression scheme outperforms traditional compression methods for decompression times and has a competitive compression ratio. For log search, our approach achieves better query times than using traditional compression algorithms for both in-core and out-of-core cases. For joining log streams, our approach achieves the same join quality with only 30% memory of uncompressed streams.
Keywords :
Web services; cloud computing; data analysis; data compression; Cowic; Internet; cloud service; column-wise independent compression; log joining system; log search system; log stream analysis; Adaptation models; Compression algorithms; Data models; Dictionaries; Indexes; Libraries; Training; Log Joining; Log Search; Log Stream Compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.45
Filename :
7152468
Link To Document :
بازگشت