DocumentCode :
2606161
Title :
Analysis of co-occurrence relationship between named entity in Web page
Author :
Lin, Weiyun ; Jiang, Zongli
Author_Institution :
Sch. of Comput. Sci., Beijing Univ. of Technol., Beijing, China
fYear :
2011
fDate :
27-29 June 2011
Firstpage :
1115
Lastpage :
1118
Abstract :
In order to analyze the closeness of named entities in massive web pages, the word co-occurrence algorithm FDC(frequency, term distance, co-collection ratio) is employed to evaluate the co-occurrence relationships between the named entities by their co-occurrence frequency, relative position and the ratio of co-occurrence among a document. And by employing the proper value of named entities´ co-occurrence frequency and the relative distances between the two named entities, the FDC algorithm is improved. Experiments show that the improved FDC algorithm has better performance.
Keywords :
Web sites; document handling; FDC algorithm; Web page; cooccurrence relationship; document cooccurrence; named entity; Algorithm design and analysis; Continuous wavelet transforms; Data mining; Educational institutions; HTML; Web pages; FDC algorithm; co-occurrence; massive information; named entity; relevance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Service System (CSSS), 2011 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-9762-1
Type :
conf
DOI :
10.1109/CSSS.2011.5973938
Filename :
5973938
Link To Document :
بازگشت