• DocumentCode
    3470815
  • Title

    A fuzzy set approach to extracting keywords from abstracts

  • Author

    Makrehchi, Masoud ; Kame, Mohamed

  • Author_Institution
    Dept. of Syst. Design Eng., Waterloo Univ., Ont., Canada
  • Volume
    2
  • fYear
    2004
  • fDate
    27-30 June 2004
  • Firstpage
    528
  • Abstract
    An unsupervised keyword extraction method based on corpora is proposed. After representing each document in the collection by a fuzzy set of candidate keywords, the problem is translated into finding appropriate fuzzy membership degree for each candidate. In order to determine the membership degrees, first, all terms of the vocabulary are mapped into a two-dimensional space called class-collection map by obtaining two newly proposed fuzzy measures called fuzzy significance and fuzzy relevance for each term. At the second step, the mapped terms are grouped into three categories, namely, features, keywords and stopwords that are discriminated by their contribution to the meaning of the documents. Instead of a clustering approach for grouping purpose, a fuzzy rule base is provided. The method is independent of the language and data dimensionality. It does not require the use of dictionary, thesaurus nor natural language processing.
  • Keywords
    fuzzy set theory; information retrieval; vocabulary; word processing; abstracts; fuzzy rule; fuzzy set; keywords extraction; vocabulary; Abstracts; Design engineering; Fuzzy sets; Fuzzy systems; Machine intelligence; Pattern analysis; System analysis and design; Text mining; Vocabulary; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Information, 2004. Processing NAFIPS '04. IEEE Annual Meeting of the
  • Print_ISBN
    0-7803-8376-1
  • Type

    conf

  • DOI
    10.1109/NAFIPS.2004.1337356
  • Filename
    1337356