• DocumentCode
    234640
  • Title

    Lexical similarity using fuzzy Euclidean distance

  • Author

    Ayeldeen, Heba ; Hassanien, Aboul Ella ; Fahmy, Aly A.

  • Author_Institution
    Fac. of Comput. & Inf., Cairo Univ., Cairo, Egypt
  • fYear
    2014
  • fDate
    19-20 April 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Knowledge exaction and text representation are considered as the main concepts concerning organizations nowadays. The estimation of the semantic similarity between words provides a valuable method to enable the understanding of texts. In the field of biomedical domains, using Ontologies have been very effective due to their scalability and efficiency. In this paper, we aim to cluster and classify medical thesis data to better discover the commonalities between theses data and hence, improve the accuracy of the similarity estimation which in return improves the scientific research sector. Experimental evaluations using 4,878 theses data set in the medical sector at Cairo University indicate that the proposed approach yields results that correlate more closely with human assessments than other by using the standard ontology (MeSH). Two different algorithms were used; the first is Lexical similarity and then applying K-means clustering and the second is fuzzy Euclidean distance clustering algorithm after using MeSH ontology on medical theses data for better categorization of the keywords within the data.
  • Keywords
    data mining; fuzzy set theory; medical computing; ontologies (artificial intelligence); pattern clustering; text analysis; Cairo University; MeSH ontology; biomedical domain; fuzzy Euclidean distance clustering algorithm; human assessments; k-means clustering; knowledge exaction; lexical similarity; medical sector; medical theses data; medical thesis data; scientific research sector; semantic similarity; similarity estimation; standard ontology; text representation; Clustering algorithms; Data mining; Educational institutions; Euclidean distance; Ontologies; Organizations; Surgery; Classification; Clustering; Fuzzy Euclidean distance Algorithm; Lexical similarity; MeSH ontology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering and Technology (ICET), 2014 International Conference on
  • Conference_Location
    Cairo
  • Type

    conf

  • DOI
    10.1109/ICEngTechnol.2014.7016801
  • Filename
    7016801