• DocumentCode
    524395
  • Title

    Keyphrases extraction research based on structure of document

  • Author

    Huang, Huan ; Wang, Hong

  • Author_Institution
    Nat. Eng. Res. Center for E-Learning, HuaZhong Normal Univ., Wuhan, China
  • Volume
    3
  • fYear
    2010
  • fDate
    22-24 June 2010
  • Abstract
    Keyphrase is the foundation of text categorization, automatic summary and information retrieval, so the research of automatic keyphrase extraction has important significance. The current keyphrase extraction methods don´t take full advantage of the structural features of the document, too much emphasis on the importance of term frequency, which result in the low accuracy of keyphrase extraction. According to these, the paper proposed a keyphrase extraction method based on structure features of the document. It combined with term frequency, location and length information to automatically extract keyphrases.
  • Keywords
    category theory; information retrieval; text analysis; automatic summary; document structure; information retrieval; keyphrases extraction research; text categorization; Computer science education; Data mining; Educational technology; Electronic learning; Frequency; Information retrieval; Internet; Machine learning algorithms; Text categorization; Thesauri; frequency factor; keyphrases extracton; location factor; term weight;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Education Technology and Computer (ICETC), 2010 2nd International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-6367-1
  • Type

    conf

  • DOI
    10.1109/ICETC.2010.5529567
  • Filename
    5529567