Title :
Visual analysis of massive web session data
Author :
Shen, Zeqian ; Wei, Jishang ; Sundaresan, Neel ; Ma, Kwan-Liu
Abstract :
Tracking and recording users´ browsing behaviors on the web down to individual mouse clicks can create massive web session logs. While such web session data contains valuable information about user behaviors, the ever-increasing data size has placed a big challenge to analyzing and visualizing the data. An efficient data analysis framework requires both powerful computational analysis and interactive visualization. Following the visual analytics mantra “Analyze first, show the important, zoom, filter and analyze further, details on demand”, we introduce a two-tier visual analysis system, TrailExplorer2, to discover knowledge from massive log data. The system supports a visual analysis process iterating between two steps: querying web sessions and visually analyzing the retrieved data. The query happens at the lower tier where terabytes of web session data are processed in a cluster. At the upper tier, the extracted web sessions with much smaller scale are visualized on a personal computer for interactive exploration. Our system visualizes a sorted list of web sessions´ temporal patterns and enables data exploration at different levels of details. The query-visualization-exploration process iterates until a satisfactory conclusion is achieved. We present two case studies of TrailExplorer2 using real world session data from eBay to demonstrate the system´s effectiveness.
Keywords :
Internet; behavioural sciences; data analysis; data mining; data visualisation; human computer interaction; interactive systems; query processing; TrailExplorer2; Web session querying; computational analysis; data exploration; data retrieval; data size; data visualization; eBay; interactive exploration; interactive visualization; knowledge discovery; log data; massive Web session data analysis; query-visualization-exploration process; temporal patterns; two-tier visual analysis system; user browsing behavior recording; user browsing behavior tracking; Correlation; Data analysis; Data mining; Data visualization; Vegetation; Visual analytics; H.5.m [Information Interfaces and presentation (e.g., HCI)]: Miscellaneous;
Conference_Titel :
Large Data Analysis and Visualization (LDAV), 2012 IEEE Symposium on
Conference_Location :
Seattle, WA
Print_ISBN :
978-1-4673-4732-7
DOI :
10.1109/LDAV.2012.6378977