DocumentCode
618289
Title
Extraction of link context using tag tree and LALR parsing
Author
Gupta, Swastik ; Yadav, Suneel
Author_Institution
AKGEC, Ghaziabad, India
fYear
2013
fDate
11-12 April 2013
Firstpage
253
Lastpage
257
Abstract
Extraction of link context is used to know the theme of the target web page. This Link Context is used in many tasks like categorization of the web page, focused crawling. In this paper we have proposed a method to extract the link context with the help of tag tree approach and parsing method. Tag tree approach will help to find the concept of the anchor text and this concept will be used by LALR parser followed by the algorithm for extraction of link context.
Keywords
Web sites; grammars; information retrieval; text analysis; trees (mathematics); LALR parsing method; anchor text; link context extraction; tag tree approach; target Web page; Conferences; Context; Crawlers; Grammar; HTML; Web pages; XML; Anchor text; Crawling Indexing; LALR parsing; Link Context; Tag tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location
JeJu Island
Print_ISBN
978-1-4673-5759-3
Type
conf
DOI
10.1109/CICT.2013.6558100
Filename
6558100
Link To Document