Title :
Outskewer: Using Skewness to Spot Outliers in Samples and Time Series
Author :
Heymann, Sebastien ; Latapy, Matthieu ; Magnien, C.
Author_Institution :
LIP6, Univ. Pierre et Marie Curie, Paris, France
Abstract :
Finding outliers in datasets is a classical problem of high interest for (dynamic) social network analysis. However, most methods rely on assumptions which are rarely met in practice, such as prior knowledge of some outliers or about normal behavior. We propose here Out skewer, a new approach based on the notion of skewness (a measure of the symmetry of a distribution) and its evolution when extremal values are removed one by one. Our method is easy to set up, it requires no prior knowledge on the system, and it may be used on-line. We illustrate its performance on two data sets representative of many use-cases: evolution of ego-centered views of the internet topology, and logs of queries entered into a search engine.
Keywords :
Internet; query processing; search engines; social networking (online); time series; topology; Internet topology; normal behavior; outlier detection; outskewer; search engine; skewness; social network analysis; time series; Data models; Distributed databases; Gaussian distribution; Market research; Reliability; Social network services; Time series analysis; Internet topology; anomaly detection; complex networks; outlier; peer-to-peer; skewness; time series;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4673-2497-7
DOI :
10.1109/ASONAM.2012.91