Title :
Interactive data analysis: the Control project
Author :
Hellerstein, Joseph M. ; Avnur, Ron ; Chou, Andy ; Hidber, Christian ; Olston, Chris ; Raman, Vijayshankar ; Roth, Tali ; Haas, Peter J.
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
fDate :
8/1/1999 12:00:00 AM
Abstract :
Data analysis is fundamentally an iterative process in which you issue a query, receive a response, formulate the next query based on the response, and repeat. You usually don´t issue a single, perfectly chosen query and get the information you want from a database; indeed, the purpose of data analysis is to extract unknown information, and in most situations, there is no one perfect query. People naturally start by asking broad, big-picture questions and then continually refine their questions based on feedback and domain knowledge. In the Control (Continuous Output and Navigation Technology with Refinement Online) project at the University of California, Berkeley, the authors are working with collaborators at IBM, Informix, and elsewhere to explore ways to improve human-computer interaction during data analysis. The Control project´s goal is to develop interactive, intuitive techniques for analyzing massive data sets
Keywords :
data analysis; data mining; very large databases; Continuous Output and Navigation Technology with Refinement Online project; database; domain knowledge; feedback; human-computer interaction; interactive data analysis; intuitive techniques; iterative process; massive data set analysis; query; unknown information extraction; Algorithm design and analysis; Association rules; Clustering algorithms; Control systems; Data analysis; Data mining; Data visualization; Database languages; Database systems; Feedback;