Title :
Document Clustering in Personal Dataspace
Author :
Liu, Debao ; Yang, Dan ; Nie, Tiezheng ; Kou, Yue ; Shen, Derong
Author_Institution :
Dept. of Comput. Sci. & Eng., Northeastern Univ., Shenyang, China
Abstract :
In Personal Dataspace (PDS), documents containing a lot of useful information play an important role in our daily work. However, it is difficult to manage the information in these documents efficiently. In this paper, we first extract some frequent terms from documents, and then cluster these documents based on the terms. Thus users can query the documents based on their contents conveniently. The experiments demonstrate the accuracy and efficiency of the key techniques in our approach.
Keywords :
database management systems; document handling; information management; knowledge acquisition; query processing; document clustering; frequent term extraction; information management; personal dataspace; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Contracts; Data mining; Databases; Merging; FP-tree; dataspaces; document clustering;
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2010 7th
Conference_Location :
Hohhot
Print_ISBN :
978-1-4244-8440-9
DOI :
10.1109/WISA.2010.16