DocumentCode :
2555510
Title :
TOPIC ISLANDSTM-a wavelet-based text visualization system
Author :
Miller, Nancy E. ; Wong, Pak Chung ; Brewster, Mary ; Foote, Harlan
Author_Institution :
Pacific Northwest Lab., Richland, WA, USA
fYear :
1998
fDate :
24-24 Oct. 1998
Firstpage :
189
Lastpage :
196
Abstract :
We present a novel approach to visualize and explore unstructured text. The underlying technology, called TOPIC-O-GRAPHYTM, applies wavelet transforms to a custom digital signal constructed from words within a document. The resultant multiresolution wavelet energy is used to analyze the characteristics of the narrative flow in the frequency domain, such as theme changes, which is then related to the overall thematic content of the text document using statistical methods. The thematic characteristics of a document can be analyzed at varying degrees of detail, ranging from section-sized text partitions to partitions consisting of a few words. Using this technology, we are developing a visualization system prototype known as TOPIC ISLANDS to browse a document, generate fuzzy document outlines, summarize text by levels of detail and according to user interests, define meaningful subdocuments, query text content, and provide summaries of topic evolution.
Keywords :
data visualisation; document handling; statistical analysis; wavelet transforms; TOPIC ISLANDS; TOPIC-O-GRAPHY; document; fuzzy document outlines; multiresolution wavelet energy; query processing; statistical methods; text visualization system; unstructured text; user interests; wavelet transforms; Energy resolution; Frequency domain analysis; Fuzzy systems; Prototypes; Signal resolution; Statistical analysis; Visualization; Wavelet analysis; Wavelet domain; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Visualization '98. Proceedings
Conference_Location :
Research Triangle Park, NC, USA
ISSN :
1070-2385
Print_ISBN :
0-8186-9176-X
Type :
conf
DOI :
10.1109/VISUAL.1998.745302
Filename :
745302
Link To Document :
بازگشت