DocumentCode :
3275408
Title :
Reconstruction of web forms for efficient web search
Author :
Mittal, Namita ; Govil, MC ; Nayak, Richi ; Jain, Neeraj
Author_Institution :
Dept. of Comput. Eng., MNIT, Jaipur, India
fYear :
2009
fDate :
14-15 Dec. 2009
Firstpage :
1
Lastpage :
5
Abstract :
Websites, notable by URLs are large collection of Web pages. They make a huge database of heterogeneous information gathered and collected distributive. The accumulated information is differentiated on the basis of certain templates, their URLs and information contained in these pages. In this research, we mainly concentrate on Web-forums. In the current circumstances, a Web crawler crawls all the Page URLs representing the Web forums, out of which some of them redirects to invalid pages, some to the pages themselves and some pages that contain classified information requires authentication to access redirect to login pages. In this paper, we aim to reconstruct the Web forums by automatically removing all those Page URLs, facing such problems. The proposed approach provides an ease to Web crawlers and make the search efficient and effective.
Keywords :
Web sites; information retrieval; URL; Web crawler; Web forms; Web search; Web sites; Web-forums; authentication; database; heterogeneous information; information classification; login pages; Australia; Authentication; Computer science; Crawlers; Databases; Information technology; Joining processes; Uniform resource locators; Web pages; Web search; Clustering; DOM tree; Template; Web Forums;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Methods and Models in Computer Science, 2009. ICM2CS 2009. Proceeding of International Conference on
Conference_Location :
Delhi
Print_ISBN :
978-1-4244-5051-0
Type :
conf
DOI :
10.1109/ICM2CS.2009.5397957
Filename :
5397957
Link To Document :
بازگشت