DocumentCode
654146
Title
A popularity-aware method for discovering server IP addresses related to websites
Author
Torres, Luis Miguel ; Magana, Eduardo ; Izal, Mikel ; Morato, Daniel
Author_Institution
Dept. de Autom. y Comput., Univ. Ptiblica de Navarra, Pamplona, Spain
fYear
2013
fDate
28-31 Oct. 2013
Firstpage
1
Lastpage
8
Abstract
The complexity of web traffic has grown in the past years as websites evolve and new services are provided over the HTTP protocol. When accessing a website, multiple connections to different servers are opened and it is usually difficult to distinguish which servers are related to which sites. However, this information is useful from the perspective of security and accounting and can also help to label web traffic and use it as ground truth for traffic classification systems. In this paper we present a method to discover server IP addresses related to specific websites in a traffic trace. Our method uses NetFlow-type records which makes it scalable and impervious to encryption of packet payloads. It is, moreover, popularity-aware in the sense that it takes into consideration the differences in the number of accesses to each site in order to provide a better identification of servers. The method can be used to gather data from a group of interesting websites or, by applying it to a representative set of websites, it can label a sizeable number of connections in a packet trace.
Keywords
IP networks; Web sites; computer network security; cryptography; network servers; transport protocols; HTTP protocol; NetFlow-type records; Web sites; Web traffic; encryption; ground truth; packet payloads; packet trace; popularity-aware method; server IP addresses; traffic classification systems; traffic trace; Browsers; Electronic mail; IP networks; Labeling; Ports (Computers); Servers; Tuning;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Information Infrastructure Symposium, 2013
Conference_Location
Trento
Type
conf
DOI
10.1109/GIIS.2013.6684350
Filename
6684350
Link To Document