DocumentCode
687572
Title
What are you Googling? - Inferring search type information through a statistical classifier
Author
Iacovazzi, Alfonso ; Baiocchi, Andrea ; Bettini, Luca
Author_Institution
DIET - Dept. of Inf. Eng., Sapienza Univ. of Rome, Rome, Italy
fYear
2013
fDate
9-13 Dec. 2013
Firstpage
747
Lastpage
753
Abstract
Privacy in communications calls primarily for information flow encryption. Packet traffic flows privacy breaches have been widely demonstrated in point-to-point communications due to information leakage from observable traffic features, like packet length, timestamp, direction. We address a point-to multipoint system, namely a Content Delivery Network, where user clients maintain and use connections with a number of servers. Specifically, we address Google search services: they are conveyed by TLS connections, by using https, either from within user accounts or even without logging as a Google services user. Https is provided to protect communications privacy. Yet, we show that by collecting the encrypted traffic and extracting simple features related to traffic activity and possibly the amount of data sent by servers to clients, effective classifiers of user activity can be realized. Specifically, we are able to distinguish which type of search a user is carrying out, among a given set of alternatives (text, images, maps, video, video on YouTube, news) with average success rates that can exceed 90%.
Keywords
cryptography; data privacy; information services; search engines; telecommunication traffic; transport protocols; Google search services; HTTPS; TLS connections; communications privacy; content delivery network; information flow encryption; information leakage; point-to-multipoint system; point-to-point communications; search type information; statistical classifier; traffic activity; Accuracy; Feature extraction; Google; IP networks; Privacy; Servers; YouTube; content delivery network; google; packet feature analysis; side-channel information leaks;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Communications Conference (GLOBECOM), 2013 IEEE
Conference_Location
Atlanta, GA
Type
conf
DOI
10.1109/GLOCOM.2013.6831162
Filename
6831162
Link To Document