• DocumentCode
    618289
  • Title

    Extraction of link context using tag tree and LALR parsing

  • Author

    Gupta, Swastik ; Yadav, Suneel

  • Author_Institution
    AKGEC, Ghaziabad, India
  • fYear
    2013
  • fDate
    11-12 April 2013
  • Firstpage
    253
  • Lastpage
    257
  • Abstract
    Extraction of link context is used to know the theme of the target web page. This Link Context is used in many tasks like categorization of the web page, focused crawling. In this paper we have proposed a method to extract the link context with the help of tag tree approach and parsing method. Tag tree approach will help to find the concept of the anchor text and this concept will be used by LALR parser followed by the algorithm for extraction of link context.
  • Keywords
    Web sites; grammars; information retrieval; text analysis; trees (mathematics); LALR parsing method; anchor text; link context extraction; tag tree approach; target Web page; Conferences; Context; Crawlers; Grammar; HTML; Web pages; XML; Anchor text; Crawling Indexing; LALR parsing; Link Context; Tag tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information & Communication Technologies (ICT), 2013 IEEE Conference on
  • Conference_Location
    JeJu Island
  • Print_ISBN
    978-1-4673-5759-3
  • Type

    conf

  • DOI
    10.1109/CICT.2013.6558100
  • Filename
    6558100