• DocumentCode
    2606161
  • Title

    Analysis of co-occurrence relationship between named entity in Web page

  • Author

    Lin, Weiyun ; Jiang, Zongli

  • Author_Institution
    Sch. of Comput. Sci., Beijing Univ. of Technol., Beijing, China
  • fYear
    2011
  • fDate
    27-29 June 2011
  • Firstpage
    1115
  • Lastpage
    1118
  • Abstract
    In order to analyze the closeness of named entities in massive web pages, the word co-occurrence algorithm FDC(frequency, term distance, co-collection ratio) is employed to evaluate the co-occurrence relationships between the named entities by their co-occurrence frequency, relative position and the ratio of co-occurrence among a document. And by employing the proper value of named entities´ co-occurrence frequency and the relative distances between the two named entities, the FDC algorithm is improved. Experiments show that the improved FDC algorithm has better performance.
  • Keywords
    Web sites; document handling; FDC algorithm; Web page; cooccurrence relationship; document cooccurrence; named entity; Algorithm design and analysis; Continuous wavelet transforms; Data mining; Educational institutions; HTML; Web pages; FDC algorithm; co-occurrence; massive information; named entity; relevance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Service System (CSSS), 2011 International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-9762-1
  • Type

    conf

  • DOI
    10.1109/CSSS.2011.5973938
  • Filename
    5973938