Title :
Extraction of link context using tag tree and LALR parsing
Author :
Gupta, Swastik ; Yadav, Suneel
Author_Institution :
AKGEC, Ghaziabad, India
Abstract :
Extraction of link context is used to know the theme of the target web page. This Link Context is used in many tasks like categorization of the web page, focused crawling. In this paper we have proposed a method to extract the link context with the help of tag tree approach and parsing method. Tag tree approach will help to find the concept of the anchor text and this concept will be used by LALR parser followed by the algorithm for extraction of link context.
Keywords :
Web sites; grammars; information retrieval; text analysis; trees (mathematics); LALR parsing method; anchor text; link context extraction; tag tree approach; target Web page; Conferences; Context; Crawlers; Grammar; HTML; Web pages; XML; Anchor text; Crawling Indexing; LALR parsing; Link Context; Tag tree;
Conference_Titel :
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location :
JeJu Island
Print_ISBN :
978-1-4673-5759-3
DOI :
10.1109/CICT.2013.6558100