DocumentCode
538555
Title
Obtaining term similarities on concept extraction study
Author
Balkan, Kerime ; Takçi, Hidayet
Author_Institution
Gebze Yuksek Teknoloji Enstitusu, Gebze, Turkey
fYear
2010
fDate
2-5 Dec. 2010
Firstpage
578
Lastpage
582
Abstract
Concept extraction work, promises to improve the performance of the term-based text mining which has high complexity. The first phase of the concept extraction is to detect the terms have notable frequency to represent the documents. With grouping these terms an important function will be implemented on the way conception. Transition from terms to concepts; by clustering the terms according to similarities between terms, and then by labeling these clusters with an expert. The parameters of clustering algorithm and the quality of the data set will affect the success of this process. In this study, the three methods for term similarity are examined and the the most successful one is tried to find. Study is performed on Turkish language.
Keywords
data mining; natural language processing; pattern clustering; text analysis; Turkish language; clustering algorithm; concept extraction study; term-based text mining; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Digital signal processing; Information retrieval; Semantics; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical, Electronics and Computer Engineering (ELECO), 2010 National Conference on
Conference_Location
Bursa
Print_ISBN
978-1-4244-9588-7
Electronic_ISBN
978-605-01-0013-6
Type
conf
Filename
5698108
Link To Document