DocumentCode
524395
Title
Keyphrases extraction research based on structure of document
Author
Huang, Huan ; Wang, Hong
Author_Institution
Nat. Eng. Res. Center for E-Learning, HuaZhong Normal Univ., Wuhan, China
Volume
3
fYear
2010
fDate
22-24 June 2010
Abstract
Keyphrase is the foundation of text categorization, automatic summary and information retrieval, so the research of automatic keyphrase extraction has important significance. The current keyphrase extraction methods don´t take full advantage of the structural features of the document, too much emphasis on the importance of term frequency, which result in the low accuracy of keyphrase extraction. According to these, the paper proposed a keyphrase extraction method based on structure features of the document. It combined with term frequency, location and length information to automatically extract keyphrases.
Keywords
category theory; information retrieval; text analysis; automatic summary; document structure; information retrieval; keyphrases extraction research; text categorization; Computer science education; Data mining; Educational technology; Electronic learning; Frequency; Information retrieval; Internet; Machine learning algorithms; Text categorization; Thesauri; frequency factor; keyphrases extracton; location factor; term weight;
fLanguage
English
Publisher
ieee
Conference_Titel
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-6367-1
Type
conf
DOI
10.1109/ICETC.2010.5529567
Filename
5529567
Link To Document