DocumentCode :
618289
Title :
Extraction of link context using tag tree and LALR parsing
Author :
Gupta, Swastik ; Yadav, Suneel
Author_Institution :
AKGEC, Ghaziabad, India
fYear :
2013
fDate :
11-12 April 2013
Firstpage :
253
Lastpage :
257
Abstract :
Extraction of link context is used to know the theme of the target web page. This Link Context is used in many tasks like categorization of the web page, focused crawling. In this paper we have proposed a method to extract the link context with the help of tag tree approach and parsing method. Tag tree approach will help to find the concept of the anchor text and this concept will be used by LALR parser followed by the algorithm for extraction of link context.
Keywords :
Web sites; grammars; information retrieval; text analysis; trees (mathematics); LALR parsing method; anchor text; link context extraction; tag tree approach; target Web page; Conferences; Context; Crawlers; Grammar; HTML; Web pages; XML; Anchor text; Crawling Indexing; LALR parsing; Link Context; Tag tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location :
JeJu Island
Print_ISBN :
978-1-4673-5759-3
Type :
conf
DOI :
10.1109/CICT.2013.6558100
Filename :
6558100
Link To Document :
بازگشت