Title :
A Heuristic-Based Co-clustering Algorithm for the Internet Traffic Classification
Author :
Wei Lu ; Ling Xue
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Victoria, Victoria, BC, Canada
Abstract :
Classifying network traffic in a real-time fashion on large-scale communication networks has been extensively studied in recent years due to its importance in many areas such as network security, QoS provisioning, and network management. To address this issue, port numbers and packet payload signatures have been widely used in many existing traffic classification tools. They, however, are far away from completed due to for example the increase of new Internet applications and traffic encryption. In this paper, we propose a hybrid framework to classify the Internet traffic, combining a classifier based on the well-known port numbers and packet payload signatures, and a novel heuristic-based co-clustering algorithm for classifying the leftover unknown Internet traffic. Taking advantage of a fast unsupervised co-clustering algorithm with simple flow-based features, our traffic classifier can perform a real-time computing online for application discovery on the Internet. Experimental evaluations with over 200,000 network flows collected over two consecutive days on a large-scale WiFi ISP show that the proposed approach successfully classifies a large portion of the Internet traffic missed by the signature based classifier while also reducing the false alarm rate.
Keywords :
Internet; cryptography; digital signatures; learning (artificial intelligence); pattern clustering; telecommunication security; telecommunication traffic; Internet applications; Internet traffic classification; application discovery; false alarm rate; flow-based features; heuristic-based coclustering algorithm; large-scale WiFi ISP; large-scale communication networks; machine learning; network flows; network traffic classification; packet payload signatures; port numbers; real-time computing online; traffic classification tools; traffic classifier; traffic encryption; Classification algorithms; Communities; Heuristic algorithms; IP networks; Internet; Payloads; Ports (Computers); Internet traffic classification; machine learning;
Conference_Titel :
Advanced Information Networking and Applications Workshops (WAINA), 2014 28th International Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
978-1-4799-2652-7
DOI :
10.1109/WAINA.2014.16