DocumentCode :
3470815
Title :
A fuzzy set approach to extracting keywords from abstracts
Author :
Makrehchi, Masoud ; Kame, Mohamed
Author_Institution :
Dept. of Syst. Design Eng., Waterloo Univ., Ont., Canada
Volume :
2
fYear :
2004
fDate :
27-30 June 2004
Firstpage :
528
Abstract :
An unsupervised keyword extraction method based on corpora is proposed. After representing each document in the collection by a fuzzy set of candidate keywords, the problem is translated into finding appropriate fuzzy membership degree for each candidate. In order to determine the membership degrees, first, all terms of the vocabulary are mapped into a two-dimensional space called class-collection map by obtaining two newly proposed fuzzy measures called fuzzy significance and fuzzy relevance for each term. At the second step, the mapped terms are grouped into three categories, namely, features, keywords and stopwords that are discriminated by their contribution to the meaning of the documents. Instead of a clustering approach for grouping purpose, a fuzzy rule base is provided. The method is independent of the language and data dimensionality. It does not require the use of dictionary, thesaurus nor natural language processing.
Keywords :
fuzzy set theory; information retrieval; vocabulary; word processing; abstracts; fuzzy rule; fuzzy set; keywords extraction; vocabulary; Abstracts; Design engineering; Fuzzy sets; Fuzzy systems; Machine intelligence; Pattern analysis; System analysis and design; Text mining; Vocabulary; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Information, 2004. Processing NAFIPS '04. IEEE Annual Meeting of the
Print_ISBN :
0-7803-8376-1
Type :
conf
DOI :
10.1109/NAFIPS.2004.1337356
Filename :
1337356
Link To Document :
بازگشت