• DocumentCode
    538555
  • Title

    Obtaining term similarities on concept extraction study

  • Author

    Balkan, Kerime ; Takçi, Hidayet

  • Author_Institution
    Gebze Yuksek Teknoloji Enstitusu, Gebze, Turkey
  • fYear
    2010
  • fDate
    2-5 Dec. 2010
  • Firstpage
    578
  • Lastpage
    582
  • Abstract
    Concept extraction work, promises to improve the performance of the term-based text mining which has high complexity. The first phase of the concept extraction is to detect the terms have notable frequency to represent the documents. With grouping these terms an important function will be implemented on the way conception. Transition from terms to concepts; by clustering the terms according to similarities between terms, and then by labeling these clusters with an expert. The parameters of clustering algorithm and the quality of the data set will affect the success of this process. In this study, the three methods for term similarity are examined and the the most successful one is tried to find. Study is performed on Turkish language.
  • Keywords
    data mining; natural language processing; pattern clustering; text analysis; Turkish language; clustering algorithm; concept extraction study; term-based text mining; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Digital signal processing; Information retrieval; Semantics; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical, Electronics and Computer Engineering (ELECO), 2010 National Conference on
  • Conference_Location
    Bursa
  • Print_ISBN
    978-1-4244-9588-7
  • Electronic_ISBN
    978-605-01-0013-6
  • Type

    conf

  • Filename
    5698108