Title :
Visualizing Statistical Mix Effects and Simpson's Paradox
Author :
Armstrong, Zan ; Wattenberg, Martin
Abstract :
We discuss how “mix effects” can surprise users of visualizations and potentially lead them to incorrect conclusions. This statistical issue (also known as “omitted variable bias” or, in extreme cases, as “Simpson´s paradox”) is widespread and can affect any visualization in which the quantity of interest is an aggregated value such as a weighted sum or average. Our first contribution is to document how mix effects can be a serious issue for visualizations, and we analyze how mix effects can cause problems in a variety of popular visualization techniques, from bar charts to treemaps. Our second contribution is a new technique, the “comet chart,” that is meant to ameliorate some of these issues.
Keywords :
data visualisation; statistical analysis; Simpson paradox; average value; bar charts; comet chart technique; data visualization; omitted variable bias; statistical mix effects; treemaps; visualization techniques; weighted sum value; Data visualization; Image color analysis; Image segmentation; Statistics; Mix effects; Omitted variable bias; Simpson´s paradox; Statistics;
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
DOI :
10.1109/TVCG.2014.2346297