Title :
Mondou: interface with text data mining for Web search engine
Author :
Kawano, Hiroyuki ; Hasegawa, Toshiharu
Author_Institution :
Dept. of Appl. Syst. Sci., Kyoto Univ., Japan
Abstract :
In order to submit queries to Web search engines, we have to carefully choose the suitable combination of keywords. Without rich background knowledge about keywords in Web documents, it is too difficult to find out invaluable URLs by search engines. Applying techniques of text data mining to Web resource discovery, we try to derive associative keywords by an extended association algorithm. We explain the interface of the Web resource discovery system using association rules, which are derived from the cluster of Japanese HTML pages in the text database. We discuss an evaluation of the Mondou system and Java applet order to visualize search results with multidimensional measurements
Keywords :
Internet; deductive databases; information retrieval; knowledge acquisition; object-oriented languages; online front-ends; page description languages; user interfaces; HTML; Internet; Java applet; Mondou system; URL; Web search engine; World Wide Web documents; association rules; extended association algorithm; keywords; multidimensional measurements; queries; resource discovery; search results; text data mining; text database; user interface; Association rules; Clustering algorithms; Data mining; HTML; Java; Search engines; Uniform resource locators; Visual databases; Visualization; Web search;
Conference_Titel :
System Sciences, 1998., Proceedings of the Thirty-First Hawaii International Conference on
Conference_Location :
Kohala Coast, HI
Print_ISBN :
0-8186-8255-8
DOI :
10.1109/HICSS.1998.648322