• DocumentCode
    621173
  • Title

    Self-Learning Classifier for Internet traffic

  • Author

    Grimaudo, Luigi ; Mellia, Marco ; Baralis, Elena ; Keralapura, Ram

  • Author_Institution
    Politec. di Torino, Turin, Italy
  • fYear
    2013
  • fDate
    14-19 April 2013
  • Firstpage
    423
  • Lastpage
    428
  • Abstract
    Network visibility is a critical part of traffic engineering, network management, and security. Recently, unsupervised algorithms have been envisioned as a viable alternative to automatically identify classes of traffic. However, the accuracy achieved so far does not allow to use them for traffic classification in practical scenario. In this paper, we propose SeLeCT, a Self-Learning Classifier for Internet traffic. It uses unsupervised algorithms along with an adaptive learning approach to automatically let classes of traffic emerge, being identified and (easily) labeled. SeLeCT automatically groups flows into pure (or homogeneous) clusters using alternating simple clustering and filtering phases to remove outliers. SeLeCT uses an adaptive learning approach to boost its ability to spot new protocols and applications. Finally, SeLeCT also simplifies label assignment (which is still based on some manual intervention) so that proper class labels can be easily discovered. We evaluate the performance of SeLeCT using traffic traces collected in different years from various ISPs located in 3 different continents. Our experiments show that SeLeCT achieves overall accuracy close to 98%. Unlike state-of-art classifiers, the biggest advantage of SeLeCT is its ability to help discovering new protocols and applications in an almost automated fashion.
  • Keywords
    Internet; computer network performance evaluation; computer network security; learning (artificial intelligence); pattern clustering; telecommunication network management; telecommunication traffic; ISP; Internet traffic; SeLeCT; adaptive learning approach; clustering phase; filtering phase; label assignment; network management; network visibility; outlier removal; performance evaluation; security; self-learning classifier; traffic engineering; traffic traces; unsupervised algorithms; Accuracy; Algorithm design and analysis; Clustering algorithms; Labeling; Ports (Computers); Protocols; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications Workshops (INFOCOM WKSHPS), 2013 IEEE Conference on
  • Conference_Location
    Turin
  • Print_ISBN
    978-1-4799-0055-8
  • Type

    conf

  • DOI
    10.1109/INFCOMW.2013.6562900
  • Filename
    6562900