• DocumentCode
    166407
  • Title

    A Heuristic-Based Co-clustering Algorithm for the Internet Traffic Classification

  • Author

    Wei Lu ; Ling Xue

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Victoria, Victoria, BC, Canada
  • fYear
    2014
  • fDate
    13-16 May 2014
  • Firstpage
    49
  • Lastpage
    54
  • Abstract
    Classifying network traffic in a real-time fashion on large-scale communication networks has been extensively studied in recent years due to its importance in many areas such as network security, QoS provisioning, and network management. To address this issue, port numbers and packet payload signatures have been widely used in many existing traffic classification tools. They, however, are far away from completed due to for example the increase of new Internet applications and traffic encryption. In this paper, we propose a hybrid framework to classify the Internet traffic, combining a classifier based on the well-known port numbers and packet payload signatures, and a novel heuristic-based co-clustering algorithm for classifying the leftover unknown Internet traffic. Taking advantage of a fast unsupervised co-clustering algorithm with simple flow-based features, our traffic classifier can perform a real-time computing online for application discovery on the Internet. Experimental evaluations with over 200,000 network flows collected over two consecutive days on a large-scale WiFi ISP show that the proposed approach successfully classifies a large portion of the Internet traffic missed by the signature based classifier while also reducing the false alarm rate.
  • Keywords
    Internet; cryptography; digital signatures; learning (artificial intelligence); pattern clustering; telecommunication security; telecommunication traffic; Internet applications; Internet traffic classification; application discovery; false alarm rate; flow-based features; heuristic-based coclustering algorithm; large-scale WiFi ISP; large-scale communication networks; machine learning; network flows; network traffic classification; packet payload signatures; port numbers; real-time computing online; traffic classification tools; traffic classifier; traffic encryption; Classification algorithms; Communities; Heuristic algorithms; IP networks; Internet; Payloads; Ports (Computers); Internet traffic classification; machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops (WAINA), 2014 28th International Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    978-1-4799-2652-7
  • Type

    conf

  • DOI
    10.1109/WAINA.2014.16
  • Filename
    6844612