DocumentCode :
3103314
Title :
Descriptive words for small Web collections
Author :
Deepak, P. ; John, Jyothi
Author_Institution :
Model Eng. Coll., Kochi, India
fYear :
2004
fDate :
19-23 April 2004
Firstpage :
633
Lastpage :
634
Abstract :
This paper deals with the problem of identifying the subject of small sparsely linked collections of Web documents (Web community). In the course of attempts to find solutions for many problems concerning the Web, we are often left with a handful of pages dealing with something in common, but with very few links within them. This paper presents algorithms, which work on such collections and output a set of descriptive words, descriptive of the collection, ordered in the decreasing order of relevance. The set of most relevant words, which can be aptly called the "subject set", provides a close approximation of the topic that the collection deals with. The subject set of the first few results from a Web search could be used to further refine Web search. It could greatly simplify the Web search process by indexing web communities. It could well be used for parental monitoring systems, where the subject set of the collection of pages browsed by the child could point out intentions of the Web usage by the child.
Keywords :
Internet; information retrieval; search engines; text analysis; Web collection; Web document; Web search; descriptive word; indexing; page browsing; parental monitoring system; Data mining; Educational institutions; Frequency; Indexing; Monitoring; Testing; Text analysis; Web pages; Web search; Web services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
Print_ISBN :
0-7803-8482-2
Type :
conf
DOI :
10.1109/ICTTA.2004.1307924
Filename :
1307924
Link To Document :
بازگشت