DocumentCode
2358961
Title
A New Hierarchical Document Clustering Method
Author
Kou, Gang ; Peng, Yi
Author_Institution
Sch. of Manage. & Econ., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
fYear
2009
fDate
25-27 Aug. 2009
Firstpage
1789
Lastpage
1792
Abstract
The advances in digital data collection and storage technologies during the last two decades allow companies and organizations store up huge amounts of electronic documents. Large collections of electronic text present opportunities and challenges. How to assist users to find the most relevant documents from vast text collections efficiently is one of the challenges. This study proposes a hierarchical clustering method to efficiently label documents that satisfy users´ information needs. An experiment was conducted to examine the proposed method and the results shown that the clustering method is effective and efficient, in terms of both objective and subjective measures.
Keywords
information needs; information retrieval; information storage; pattern clustering; text analysis; digital data collection technologies; digital data storage technologies; document labeling; electronic documents; hierarchical document clustering method; user information needs; Abstracts; Clustering algorithms; Clustering methods; Conference management; Content addressable storage; Information retrieval; Statistics; Technology management; Text mining; Unsupervised learning; document clustering; hierachical algorithm; information retrieval; k-means; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4244-5209-5
Electronic_ISBN
978-0-7695-3769-6
Type
conf
DOI
10.1109/NCM.2009.126
Filename
5331371
Link To Document