DocumentCode :
3007597
Title :
Research on Application of Improved Text Cluster Algorithm in Intelligent QA System
Author :
Zhao, Ming ; Wang, Jianli ; Fan, Guanjun
Author_Institution :
Coll. of Comput. Sci., Yangtze Univ., Jingzhou
fYear :
2008
fDate :
25-26 Sept. 2008
Firstpage :
463
Lastpage :
466
Abstract :
With rapid development of Internet information, It is quite an important project for data mining that how to classify these large amounts of texts. In this paper, we propose an improved text classify cluster algorithm, while calculating similarity, we synthetically consider the relationship between keywords and eigenvector representation on base of term frequency statistics, thereby it lessens sensitivity of input sequence and frequency, and effectively raises similarity accuracy of small text and simple sentence as well as preciseness and recall rate of text cluster result.
Keywords :
data mining; eigenvalues and eigenfunctions; pattern classification; pattern clustering; quality assurance; statistical analysis; text analysis; Internet information; data mining; eigenvector representation; intelligent QA system; term frequency statistics; text classify cluster algorithm; text cluster algorithm; Application software; Clustering algorithms; Computer applications; Computer science; Educational institutions; Frequency; Genetics; Intelligent systems; Partitioning algorithms; Statistics; Distance function; Similar coefficient; Text Cluster; data mining; intelligent QA system;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genetic and Evolutionary Computing, 2008. WGEC '08. Second International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-0-7695-3334-6
Type :
conf
DOI :
10.1109/WGEC.2008.49
Filename :
4637486
Link To Document :
بازگشت