DocumentCode :
130978
Title :
Finding dimensions for text based on heterogeneous information network
Author :
Fei Jiang ; Xiaoguang Hong ; Zhaohui Peng ; Qingzhong Li
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear :
2014
fDate :
27-29 June 2014
Firstpage :
819
Lastpage :
823
Abstract :
We propose an approach applicable in the problem of multi dimensions text mining that finds out several sets of phrases which were referred to as the text dimension. Based on the dimensions of text found by the proposed approach, a network could be built by similarities between documents. A method is proposed to transform the network from a coarse-grained one to a fine-grained one. By repeatedly mining phrases sets from the networks of different granularities, we could get a refined text dimensions set. We provide experimental results on text mining showing the computational feasibility and effectiveness for finding text dimensions which combines text mining with network mining and can be used for learning interesting knowledge.
Keywords :
data mining; information networks; text analysis; heterogeneous information network; multidimensions text mining; text dimension; Clustering algorithms; Communities; Data mining; Databases; Feature extraction; Image edge detection; Partitioning algorithms; heterogeneous information network; information network analysis method; network mining; text dimension;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location :
Beijing
ISSN :
2327-0586
Print_ISBN :
978-1-4799-3278-8
Type :
conf
DOI :
10.1109/ICSESS.2014.6933692
Filename :
6933692
Link To Document :
بازگشت