DocumentCode :
1585900
Title :
Order-based clustering using formal concept analysis
Author :
Moriki, Akinori ; Yoshida, Shinichi
Author_Institution :
Grad. Sch. of Eng., Kochi Univ. of Technol., Kochi, Japan
fYear :
2010
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, we propose an document clustering algorithm based on formal concept analysis. In conventional clustering methods, numeric data are required and numeric processing is performed by cosine distance of numeric data as word or document vector. However, several documents of a cluster are not similar as a result of classification using conventional methods. In this paper, a novel clustering method is proposed by an application of formal concept analysis. Documents are classified into sets of documents shared same features by formal concept analysis. In addition, each set of documents can be selected in the method. We, thereby, propose document clustering which is suitable for expressing themes of documents based on information of documents as words. In this paper, formal concept analysis is applied to 100 documents of English news articles selected from Reuters-21578 database. Then, the document clustering is performed by selecting each concept on concept lattice. Elements of each article are included in all concepts connecting to lower layers of a selected concept. Those elements are set as a cluster. Each cluster has a shared topic. In addition, clusters of low-level connecting layers are set as a cluster by selecting concept on higher layers. Proposed clustering technique can be applied to text classification and summarization.
Keywords :
formal concept analysis; pattern classification; pattern clustering; text analysis; word processing; English news article; cosine distance; document clustering algorithm; formal concept analysis; numeric data processing; order based clustering; text classification; word vector; Biomedical monitoring; Biosensors; Temperature measurement; Temperature sensors; Document Clustering; Formal Concept Analysis; Order Relation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
World Automation Congress (WAC), 2010
Conference_Location :
Kobe
ISSN :
2154-4824
Print_ISBN :
978-1-4244-9673-0
Electronic_ISBN :
2154-4824
Type :
conf
Filename :
5665287
Link To Document :
بازگشت