DocumentCode
226496
Title
On the creation of a fuzzy dataset for the evaluation of fuzzy semantic similarity measures
Author
Chandran, Daniel ; Crockett, Keeley ; McLean, D.
Author_Institution
Sch. of Comput., Math. & Digital Technol., Manchester Metropolitan Univ., Manchester, UK
fYear
2014
fDate
6-11 July 2014
Firstpage
752
Lastpage
759
Abstract
Short text semantic similarity (STSS) measures are algorithms designed to compare short texts and return a level of similarity between them. However, until recently such measures have ignored perception or fuzzy based words (i.e. very hot, cold less cold) in calculations of both word and sentence similarity. Evaluation of such measures is usually achieved through the use of benchmark data sets comprising of a set of rigorously collected sentence pairs which have been evaluated by human participants. A weakness of these datasets is that the sentences pairs include limited, if any, fuzzy based words that makes them impractical for evaluating fuzzy sentence similarity measures. In this paper, a method is presented for the creation of a new benchmark dataset known as SFWD (Single Fuzzy Word Dataset). After creation the data set is then used in the evaluation of FAST, an ontology based fuzzy algorithm for semantic similarity testing that uses concepts of fuzzy and computing with words to allow for the accurate representation of fuzzy based words. The SFWD is then used to undertake a comparative analysis of other established STSS measures.
Keywords
data handling; fuzzy set theory; natural language processing; ontologies (artificial intelligence); STSS measurement; benchmark data sets; fuzzy algorithm; fuzzy based words; fuzzy semantic similarity measures; natural language processing; ontology; short text semantic similarity; single fuzzy word dataset; Atmospheric measurements; Natural languages; Ontologies; Particle measurements; Semantics; Vectors; Velocity measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems (FUZZ-IEEE), 2014 IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-2073-0
Type
conf
DOI
10.1109/FUZZ-IEEE.2014.6891571
Filename
6891571
Link To Document