DocumentCode
120798
Title
A novel approach for link context extraction using Bison parser
Author
Gupta, Swastik ; Yadav, Suneel
Author_Institution
AKGEC, Ghaziabad, India
fYear
2014
fDate
21-22 Feb. 2014
Firstpage
941
Lastpage
945
Abstract
With the advent of World Wide Web, link context has been widely used for finding the theme of the target web page. Many approaches have been used to take advantage of the link context to get the precise context of link but the approaches were not very efficient. Link Context has been used in many areas like classification of web page, search engines, topical crawlers. In this paper we have derived the link context using LALR parser (Bison parser). For this different web pages have been collected and with the help of tag tree concepts are found out. Then using Bison parser link context have been derived. We have also compared the technique with the anchor text based method using Jaccard coefficient.
Keywords
Internet; classification; context-free grammars; indexing; information retrieval; search engines; text analysis; Bison parser; Jaccard coefficient; LALR parser; Web page classification; World Wide Web; anchor text based method; link context extraction; search engine classification; topical crawler classification; Conferences; Context; Crawlers; Flexible printed circuits; Grammar; HTML; Web pages; Anchor text; Crawling; Indexing; LALR parsing; Link Context; Tag tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location
Gurgaon
Print_ISBN
978-1-4799-2571-1
Type
conf
DOI
10.1109/IAdCC.2014.6779449
Filename
6779449
Link To Document