• DocumentCode
    1203367
  • Title

    A Fuzzy Ontological Knowledge Document Clustering Methodology

  • Author

    Trappey, Amy J C ; Trappey, Charles V. ; Hsu, Fu-Chiang ; Hsiao, David W.

  • Author_Institution
    Dept. of Ind. Eng. & Manage., Nat. Taipei Univ. of Technol., Taipei
  • Volume
    39
  • Issue
    3
  • fYear
    2009
  • fDate
    6/1/2009 12:00:00 AM
  • Firstpage
    806
  • Lastpage
    814
  • Abstract
    This correspondence presents a novel hierarchical clustering approach for knowledge document self-organization, particularly for patent analysis. Current keyword-based methodologies for document content management tend to be inconsistent and ineffective when partial meanings of the technical content are used for cluster analysis. Thus, a new methodology to automatically interpret and cluster knowledge documents using an ontology schema is presented. Moreover, a fuzzy logic control approach is used to match suitable document cluster(s) for given patents based on their derived ontological semantic webs. Finally, three case studies are used to test the approach. The first test case analyzed and clustered 100 patents for chemical and mechanical polishing retrieved from the World Intellectual Property Organization (WIPO). The second test case analyzed and clustered 100 patent news articles retrieved from online Web sites. The third case analyzed and clustered 100 patents for radio-frequency identification retrieved from WIPO. The results show that the fuzzy ontology-based document clustering approach outperforms the K-means approach in precision, recall, F-measure, and Shannon´s entropy.
  • Keywords
    data mining; document handling; fuzzy set theory; ontologies (artificial intelligence); patents; pattern clustering; fuzzy inference control; fuzzy ontological knowledge document clustering methodology; hierarchical clustering; ontology; text mining; Fuzzy inference control; hierarchical clustering; ontology schema; patent analysis; text mining;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2008.2009463
  • Filename
    4804715