مرکز منطقه ای اطلاع رساني علوم و فناوري - Research on Text Clustering Algorithm Based on K

DocumentCode :

2104596

Title :

Research on Text Clustering Algorithm Based on K_means and SOM

Author :

Xinwu, Li

Author_Institution :

Electron. Bus. Dept., Jiangxi Univ. of Finance & Econ., Nanchang

fYear :

2008

fDate :

21-22 Dec. 2008

Firstpage :

341

Lastpage :

344

Abstract :

Text clustering is one of the difficult and hot research fields in the Internet search engine research. Combination the advantages of k-means clustering and self-organizing model (SOM) techniques, a new text clustering algorithm is presented. Firstly, texts are preprocessed to satisfy succeed process. Then, the paper analyzes common k-means clustering algorithm and SOM algorithm and combines them to overcome efficiency of low stability of k-means algorithm which is very sensitive to the initial cluster center and the isolated point text. The experimental results indicate that the improved algorithm has a higher accuracy and has a better stability, compared with the original algorithm.

Keywords :

Internet; pattern clustering; search engines; self-organising feature maps; text analysis; Internet search engine research; k-means clustering; self-organizing model technique; text clustering algorithm; Algorithm design and analysis; Clustering algorithms; Finance; Information retrieval; Information technology; Internet; Iterative algorithms; Partitioning algorithms; Search engines; Stability analysis; K-means clustering; Self-Organizing Model; Text Clustering;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Information Technology Application Workshops, 2008. IITAW '08. International Symposium on

Conference_Location :

Shanghai

Print_ISBN :

978-0-7695-3505-0

Type :

conf

DOI :

10.1109/IITA.Workshops.2008.13

Filename :

4731947

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2104596