DocumentCode
3470815
Title
A fuzzy set approach to extracting keywords from abstracts
Author
Makrehchi, Masoud ; Kame, Mohamed
Author_Institution
Dept. of Syst. Design Eng., Waterloo Univ., Ont., Canada
Volume
2
fYear
2004
fDate
27-30 June 2004
Firstpage
528
Abstract
An unsupervised keyword extraction method based on corpora is proposed. After representing each document in the collection by a fuzzy set of candidate keywords, the problem is translated into finding appropriate fuzzy membership degree for each candidate. In order to determine the membership degrees, first, all terms of the vocabulary are mapped into a two-dimensional space called class-collection map by obtaining two newly proposed fuzzy measures called fuzzy significance and fuzzy relevance for each term. At the second step, the mapped terms are grouped into three categories, namely, features, keywords and stopwords that are discriminated by their contribution to the meaning of the documents. Instead of a clustering approach for grouping purpose, a fuzzy rule base is provided. The method is independent of the language and data dimensionality. It does not require the use of dictionary, thesaurus nor natural language processing.
Keywords
fuzzy set theory; information retrieval; vocabulary; word processing; abstracts; fuzzy rule; fuzzy set; keywords extraction; vocabulary; Abstracts; Design engineering; Fuzzy sets; Fuzzy systems; Machine intelligence; Pattern analysis; System analysis and design; Text mining; Vocabulary; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Information, 2004. Processing NAFIPS '04. IEEE Annual Meeting of the
Print_ISBN
0-7803-8376-1
Type
conf
DOI
10.1109/NAFIPS.2004.1337356
Filename
1337356
Link To Document