• DocumentCode
    623962
  • Title

    Self-learning classifier for Internet traffic

  • Author

    Grimaudo, Luigi ; Mellia, Marco ; Baralis, Elena ; Keralapura, Ram

  • Author_Institution
    Politec. di Torino, Turin, Italy
  • fYear
    2013
  • fDate
    14-19 April 2013
  • Firstpage
    3381
  • Lastpage
    3386
  • Abstract
    Network visibility is a critical part of traffic engineering, network management, and security. Recently, unsupervised algorithms have been envisioned as a viable alternative to automatically identify classes of traffic. However, the accuracy achieved so far does not allow to use them for traffic classification in practical scenario. In this paper, we propose SeLeCT, a Self-Learning Classifier for Internet traffic. It uses unsupervised algorithms along with an adaptive learning approach to automatically let classes of traffic emerge, being identified and (easily) labeled. SeLeCT automatically groups flows into pure (or homogeneous) clusters using alternating simple clustering and filtering phases to remove outliers. SeLeCT uses an adaptive learning approach to boost its ability to spot new protocols and applications. Finally, SeLeCT also simplifies label assignment (which is still based on some manual intervention) so that proper class labels can be easily discovered. We evaluate the performance of SeLeCT using traffic traces collected in different years from various ISPs located in 3 different continents. Our experiments show that SeLeCT achieves overall accuracy close to 98%. Unlike state-of-art classifiers, the biggest advantage of SeLeCT is its ability to help discovering new protocols and applications in an almost automated fashion.
  • Keywords
    Internet; information filtering; pattern classification; pattern clustering; protocols; telecommunication traffic; unsupervised learning; ISP; Internet service provider; Internet traffic; SeLeCT; adaptive learning approach; label assignment; network management; network security; network visibility; outlier removal; protocols; self-learning classifier; traffic classification; traffic engineering; traffic traces; unsupervised learning algorithms; Accuracy; Algorithm design and analysis; Clustering algorithms; Labeling; Ports (Computers); Protocols; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM, 2013 Proceedings IEEE
  • Conference_Location
    Turin
  • ISSN
    0743-166X
  • Print_ISBN
    978-1-4673-5944-3
  • Type

    conf

  • DOI
    10.1109/INFCOM.2013.6567168
  • Filename
    6567168