Title :
Threshold Sampling for Network Streaming Data Analysis
Author :
Zhang, Hai ; Yang, Zhuxu ; Guo, Wenming
Author_Institution :
Sch. of Comput. Sci. Eng., South China Univ. of Technol., Guangzhou
Abstract :
Network streaming data are the network traffic records coming from high-speed network links. They arrive continually and their volumes are huge. The key to analysis of network streaming data is to design a smaller yet well organized data subset to glean the most important information for quickly answering a specific type of query. In this paper, we propose a threshold sampling algorithm for network streaming data analysis. Using the threshold sampling, the analysis process can focus on the large traffic but never neglect small traffic. Moreover, the algorithm is evaluated to pick out the frequent items to detect super sources and destinations from the network streaming data. Contrasting the threshold sampling method with traditional sampling methods, we conclude that the proposed method has a better self-adaptability and controllability of resource consumption without sacrificing accuracy.
Keywords :
computer networks; data analysis; database management systems; query processing; sampling methods; telecommunication traffic; high-speed network link; network streaming data analysis; network traffic; query answering; resource consumption; threshold sampling algorithm; Biomedical engineering; Communication system traffic control; Computer networks; Computer science; Data analysis; Data engineering; Monitoring; Sampling methods; Statistics; Telecommunication traffic; Internet measurement; Sampling;
Conference_Titel :
Advanced Computer Theory and Engineering, 2008. ICACTE '08. International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-0-7695-3489-3
DOI :
10.1109/ICACTE.2008.109