Title :
FHAR: A New Text Association Rule Algorithm Based on Concept Vector and Its Application
Author :
Xinqing Geng ; Fengmei Tao
Author_Institution :
Coll. of Math. & Inf. Sci., Anshan Normal Univ., Anshan, China
Abstract :
A novel text association rule approach FHAR algorithm is presented. To overcome the defect of traditional keywords which does not take into account the semantic relation between keywords, FHAR algorithm in the paper is based on concept vector. The density of semantic field and the weight of meaning are used to determine the concept of the keywords, which not only adds the texts semantic, but also reduces vector dimensions, FHAR algorithm adopts improved HASH table for efficient large item set generation. The stored address of item sets is determined by a new hash function. Based on the new hash table, tree structure is constructed. When FHAR algorithm is applied to text mining, the text association rule is derived. Experiments show FHAR algorithm possesses higher efficiency and accuracy than Apriori algorithm.
Keywords :
data mining; semantic networks; text analysis; tree data structures; FHAR algorithm; HASH table; concept vector; hash function; itemset generation; keywords; semantic field density; text association rule algorithm; text mining; tree structure; Algorithm design and analysis; Association rules; Educational institutions; Itemsets; Semantics; Text mining; Vectors; association rule; concept vector; text mining;
Conference_Titel :
Multimedia Information Networking and Security (MINES), 2012 Fourth International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4673-3093-0
DOI :
10.1109/MINES.2012.113