Title :
Clustering and classification in structured data domains using Fuzzy Lattice Neurocomputing (FLN)
Author :
Petridis, Vassilios ; Kaburlasos, Vassilis G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Aristotelian Univ. of Thessaloniki, Greece
Abstract :
A connectionist scheme, namely, σ-Fuzzy Lattice Neurocomputing scheme or σ-FLN for short, which has been introduced in the literature lately for clustering in a lattice data domain, is employed for computing clusters of directed graphs in a master-graph. New tools are presented and used, including a convenient inclusion measure function for clustering graphs. A directed graph is treated by σ-FLN as a single datum in the mathematical lattice of subgraphs stemming from a master-graph. A series of experiments is detailed where the master-graph emanates from a thesaurus of spoken language synonyms. The words of the thesaurus are fed to σ-FLN in order to compute clusters of semantically related words, namely hyperwords. The arithmetic parameters of σ-FLN can be adjusted so as to calibrate the total number of hyperwords computed in a specific application. It is demonstrated how the employment of hyperwords implies a reduction, based on the a priori knowledge of semantics contained in the thesaurus, in the number of features to be used for document classification. In a series of comparative experiments for document classification, it appears that the proposed method favorably improves classification accuracy in problems involving longer documents, whereas performance deteriorates in problems involving short documents
Keywords :
data handling; directed graphs; document handling; fuzzy neural nets; linguistics; natural languages; pattern clustering; thesauri; σ-FLN; σ-Fuzzy Lattice Neurocomputing scheme; FLN; Fuzzy Lattice Neurocomputing; a priori knowledge; arithmetic parameters; classification; classification accuracy; clustering; clustering graphs; connectionist scheme; directed graphs; document classification; hyperwords; inclusion measure function; lattice data domain; longer documents; master-graph; mathematical lattice; semantically related words; semantics; short documents; spoken language synonyms; structured data domains; subgraphs; thesaurus; Arithmetic; Data processing; Employment; Function approximation; Fuzzy neural networks; Lattices; Natural languages; Neural networks; Parallel processing; Thesauri;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on