• DocumentCode
    2813167
  • Title

    Extracting baseline patterns in Internet traffic using Robust Principal Components

  • Author

    Bandara, Vidarshana W. ; Jayasumana, Anura P.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Colorado State Univ., Fort Collins, CO, USA
  • fYear
    2011
  • fDate
    4-7 Oct. 2011
  • Firstpage
    407
  • Lastpage
    415
  • Abstract
    Robust BaseLine (RBL) is a formal technique for extracting the baseline of network traffic to capture the underlying traffic trend. A range of applications such as anomaly detection and load balancing rely on baseline estimation. Once the fundamental period of the pattern for analysis is recognized, e.g., based on user interest or a period detector such as Autocorrelation Function (ACF), the basic extraction is carried out in two steps. First, the common component across the dataset is separated using Robust Principal Component Analysis (RPCA). The fundamental pattern in the common component is extracted using Principal Component Analysis (PCA) in the second step. Scaling factors required to fit the base-pattern back into the data are returned automatically by PCA. Two types of traffic baselines may be extracted: RBL-L captures the common behavior across time on a single link, and RBL-N captures the common behavior across a network of links, i.e., in space. RBL-N is particularly useful for specifying traffic matrices more efficiently over time, which normally requires multiple updates to follow baseline trends. The derived base-patterns for a single link or a single time period is then extended over the entire network or thru the entire observation period with a compressive analysis. The compressed base-pattern provides a smoother baseline and also a filter to separate baseline traffic and the deviations on the fly from traffic measurements. When compared against BLGBA (Baseline for Automatic Backbone Management) the proposed scheme provides a less noisy, more precisely fitting baseline. It is also more effective in revealing anomalies.
  • Keywords
    Internet; principal component analysis; telecommunication traffic recording; baseline patterns; compressed base pattern; internet traffic; robust baseline; robust principal component analysis; traffic baselines; traffic measurement; Data mining; Hidden Markov models; Internet; Principal component analysis; Robustness; Sparse matrices; Time frequency analysis; Anomaly detection; Baselining; Internet Traffic; Load balancing; Traffic characterization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Local Computer Networks (LCN), 2011 IEEE 36th Conference on
  • Conference_Location
    Bonn
  • ISSN
    0742-1303
  • Print_ISBN
    978-1-61284-926-3
  • Type

    conf

  • DOI
    10.1109/LCN.2011.6115501
  • Filename
    6115501