• DocumentCode
    81691
  • Title

    Synoptic Graphlet: Bridging the Gap Between Supervised and Unsupervised Profiling of Host-Level Network Traffic

  • Author

    Himura, Yosuke ; Fukuda, Kenji ; Cho, Kun ; Borgnat, Pierre ; Abry, Patrice ; Esaki, Hiroshi

  • Author_Institution
    Univ. of Tokyo, Tokyo, Japan
  • Volume
    21
  • Issue
    4
  • fYear
    2013
  • fDate
    Aug. 2013
  • Firstpage
    1284
  • Lastpage
    1297
  • Abstract
    End-host profiling by analyzing network traffic comes out as a major stake in traffic engineering. Graphlet constitutes an efficient and common framework for interpreting host behaviors, which essentially consists of a visual representation as a graph. However, graphlet analyses face the issues of choosing between supervised and unsupervised approaches. The former can analyze a priori defined behaviors but is blind to undefined classes, while the latter can discover new behaviors at the cost of difficult a posteriori interpretation. This paper aims at bridging the gap between the two. First, to handle unknown classes, unsupervised clustering is originally revisited by extracting a set of graphlet-inspired attributes for each host. Second, to recover interpretability for each resulting cluster, a synoptic graphlet, defined as a visual graphlet obtained by mapping from a cluster, is newly developed. Comparisons against supervised graphlet-based, port-based, and payload-based classifiers with two datasets demonstrate the effectiveness of the unsupervised clustering of graphlets and the relevance of the a posteriori interpretation through synoptic graphlets. This development is further complemented by studying evolutionary tree of synoptic graphlets, which quantifies the growth of graphlets when increasing the number of inspected packets per host.
  • Keywords
    Internet; graph theory; telecommunication traffic; Internet traffic; graph visual representation; graphlet based classifiers; host level network traffic; inspected packets; payload based classifiers; port based classifiers; supervised profiling; synoptic graphlet; traffic engineering; unsupervised clustering; unsupervised profiling; Feature extraction; Payloads; Peer to peer computing; Shape; Vectors; Visualization; Internet traffic analysis; microscopic graph evolution; unsupervised host profiling; visualization;
  • fLanguage
    English
  • Journal_Title
    Networking, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6692
  • Type

    jour

  • DOI
    10.1109/TNET.2012.2226603
  • Filename
    6365779