• DocumentCode
    2512503
  • Title

    Atypical behavior identification in large-scale network traffic

  • Author

    Best, Daniel M. ; Hafen, Ryan P. ; Olsen, Bryan K. ; Pike, W.A.

  • Author_Institution
    Pacific Northwest Nat. Lab., Richland, WA, USA
  • fYear
    2011
  • fDate
    23-24 Oct. 2011
  • Firstpage
    15
  • Lastpage
    22
  • Abstract
    Cyber analysts are faced with the daunting challenge of identifying exploits and threats within potentially billions of daily records of network traffic. Enterprise-wide cyber traffic involves hundreds of millions of distinct IP addresses and results in data sets ranging from terabytes to petabytes of raw data. Creating behavioral models and identifying trends based on those models requires data intensive architectures and techniques that can scale as data volume increases. Analysts need scalable visualization methods that foster interactive exploration of data and enable identification of behavioral anomalies. Developers must carefully consider application design, storage, processing, and display to provide usability and interactivity with large-scale data. We present an application that highlights atypical behavior in enterprise network flow records. This is accomplished by utilizing data intensive architectures to store the data, aggregation techniques to optimize data access, statistical techniques to characterize behavior, and a visual analytic environment to render the behavioral trends, highlight atypical activity, and allow for exploration.
  • Keywords
    IP networks; business communication; computer network security; data visualisation; set theory; statistical analysis; telecommunication traffic; IP address; behavior identification; behavioral anomaly; behavioral model; cyber analysts; data access optimization; data aggregation technique; data intensive architecture; data sets; data volume; enterprise network flow record; enterprise-wide cyber traffic; foster interactive exploration; large scale network traffic; scalable visualization method; statistical technique; visual analytic environment; Analytical models; Data models; Data visualization; Databases; IP networks; Measurement; Visual analytics; Time series; cyber analytics; large-scale data; visual analytics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Large Data Analysis and Visualization (LDAV), 2011 IEEE Symposium on
  • Conference_Location
    Providence, Rl
  • Print_ISBN
    978-1-4673-0156-5
  • Type

    conf

  • DOI
    10.1109/LDAV.2011.6092312
  • Filename
    6092312