DocumentCode
112733
Title
ThemeDelta: Dynamic Segmentations over Temporal Topic Models
Author
Gad, Samah ; Javed, Waqas ; Ghani, Sohaib ; Elmqvist, Niklas ; Ewing, Tom ; Hampton, Keith N. ; Ramakrishnan, Naren
Author_Institution
Virginia Tech, Blacksburg, VA, USA
Volume
21
Issue
5
fYear
2015
fDate
May 1 2015
Firstpage
672
Lastpage
685
Abstract
We present ThemeDelta, a visual analytics system for extracting and visualizing temporal trends, clustering, and reorganization in time-indexed textual datasets. ThemeDelta is supported by a dynamic temporal segmentation algorithm that integrates with topic modeling algorithms to identify change points where significant shifts in topics occur. This algorithm detects not only the clustering and associations of keywords in a time period, but also their convergence into topics (groups of keywords) that may later diverge into new groups. The visual representation of ThemeDelta uses sinuous, variable-width lines to show this evolution on a timeline, utilizing color for categories, and line width for keyword strength. We demonstrate how interaction with ThemeDelta helps capture the rise and fall of topics by analyzing archives of historical newspapers, of U.S. presidential campaign speeches, and of social messages collected through iNeighbors, a web-based social website. ThemeDelta is evaluated using a qualitative expert user study involving three researchers from rhetoric and history using the historical newspapers corpus.
Keywords
Web sites; data analysis; text analysis; ThemeDelta; US presidential campaign speeches; Web-based social Website; dynamic temporal segmentation algorithm; historical newspapers; iNeighbors; keyword strength; temporal topic models; time-indexed textual datasets; topic modeling algorithms; visual analytics system; Data visualization; Heuristic algorithms; Layout; Market research; Tag clouds; Visual analytics; Language models; text analytics; time-series segmentation; visual representations;
fLanguage
English
Journal_Title
Visualization and Computer Graphics, IEEE Transactions on
Publisher
ieee
ISSN
1077-2626
Type
jour
DOI
10.1109/TVCG.2014.2388208
Filename
7001093
Link To Document