Title :
Descriptive words for small Web collections
Author :
Deepak, P. ; John, Jyothi
Author_Institution :
Model Eng. Coll., Kochi, India
Abstract :
This paper deals with the problem of identifying the subject of small sparsely linked collections of Web documents (Web community). In the course of attempts to find solutions for many problems concerning the Web, we are often left with a handful of pages dealing with something in common, but with very few links within them. This paper presents algorithms, which work on such collections and output a set of descriptive words, descriptive of the collection, ordered in the decreasing order of relevance. The set of most relevant words, which can be aptly called the "subject set", provides a close approximation of the topic that the collection deals with. The subject set of the first few results from a Web search could be used to further refine Web search. It could greatly simplify the Web search process by indexing web communities. It could well be used for parental monitoring systems, where the subject set of the collection of pages browsed by the child could point out intentions of the Web usage by the child.
Keywords :
Internet; information retrieval; search engines; text analysis; Web collection; Web document; Web search; descriptive word; indexing; page browsing; parental monitoring system; Data mining; Educational institutions; Frequency; Indexing; Monitoring; Testing; Text analysis; Web pages; Web search; Web services;
Conference_Titel :
Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
Print_ISBN :
0-7803-8482-2
DOI :
10.1109/ICTTA.2004.1307924