• DocumentCode
    2646112
  • Title

    Visualizing sequential patterns for text mining

  • Author

    Wong, Pak Chung ; Cowley, Wendy ; Foote, Harlan ; Jurrus, Elizabeth ; Thomas, Jim

  • Author_Institution
    Pacific Northwest Lab., Richland, WA, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    105
  • Lastpage
    111
  • Abstract
    A sequential pattern in data mining is a finite series of elements such as A→B→C→D where A, B, C, and D are elements of the same domain. The mining of sequential patterns is designed to find patterns of discrete events that frequently happen in the same arrangement along a timeline. Like association and clustering, the mining of sequential patterns is among the most popular knowledge discovery techniques that apply statistical measures to extract useful information from large datasets. As out computers become more powerful, we are able to mine bigger datasets and obtain hundreds of thousands of sequential patterns in full detail. With this vast amount of data, we argue that neither data mining nor visualization by itself can manage the information and reflect the knowledge effectively. Subsequently, we apply visualization to augment data mining in a study of sequential patterns in large text corpora. The result shows that we can learn more and more quickly in an integrated visual data-mining environment
  • Keywords
    data mining; data visualisation; pattern recognition; very large databases; data mining; knowledge discovery; large datasets; large text corpora; sequential pattern visualization; statistical measures; text mining; Association rules; Data mining; Data visualization; Event detection; Laboratories; Motion pictures; Ores; Read only memory; Statistics; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Visualization, 2000. InfoVis 2000. IEEE Symposium on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1522-404X
  • Print_ISBN
    0-7695-0804-9
  • Type

    conf

  • DOI
    10.1109/INFVIS.2000.885097
  • Filename
    885097