DocumentCode :
524395
Title :
Keyphrases extraction research based on structure of document
Author :
Huang, Huan ; Wang, Hong
Author_Institution :
Nat. Eng. Res. Center for E-Learning, HuaZhong Normal Univ., Wuhan, China
Volume :
3
fYear :
2010
fDate :
22-24 June 2010
Abstract :
Keyphrase is the foundation of text categorization, automatic summary and information retrieval, so the research of automatic keyphrase extraction has important significance. The current keyphrase extraction methods don´t take full advantage of the structural features of the document, too much emphasis on the importance of term frequency, which result in the low accuracy of keyphrase extraction. According to these, the paper proposed a keyphrase extraction method based on structure features of the document. It combined with term frequency, location and length information to automatically extract keyphrases.
Keywords :
category theory; information retrieval; text analysis; automatic summary; document structure; information retrieval; keyphrases extraction research; text categorization; Computer science education; Data mining; Educational technology; Electronic learning; Frequency; Information retrieval; Internet; Machine learning algorithms; Text categorization; Thesauri; frequency factor; keyphrases extracton; location factor; term weight;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
Type :
conf
DOI :
10.1109/ICETC.2010.5529567
Filename :
5529567
Link To Document :
بازگشت