Title :
Research on Application of Improved Text Cluster Algorithm in Intelligent QA System
Author :
Zhao, Ming ; Wang, Jianli ; Fan, Guanjun
Author_Institution :
Coll. of Comput. Sci., Yangtze Univ., Jingzhou
Abstract :
With rapid development of Internet information, It is quite an important project for data mining that how to classify these large amounts of texts. In this paper, we propose an improved text classify cluster algorithm, while calculating similarity, we synthetically consider the relationship between keywords and eigenvector representation on base of term frequency statistics, thereby it lessens sensitivity of input sequence and frequency, and effectively raises similarity accuracy of small text and simple sentence as well as preciseness and recall rate of text cluster result.
Keywords :
data mining; eigenvalues and eigenfunctions; pattern classification; pattern clustering; quality assurance; statistical analysis; text analysis; Internet information; data mining; eigenvector representation; intelligent QA system; term frequency statistics; text classify cluster algorithm; text cluster algorithm; Application software; Clustering algorithms; Computer applications; Computer science; Educational institutions; Frequency; Genetics; Intelligent systems; Partitioning algorithms; Statistics; Distance function; Similar coefficient; Text Cluster; data mining; intelligent QA system;
Conference_Titel :
Genetic and Evolutionary Computing, 2008. WGEC '08. Second International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-0-7695-3334-6
DOI :
10.1109/WGEC.2008.49