DocumentCode :
344292
Title :
Analysis of processes and large data sets by a self-organizing method
Author :
Kohonen, Teuvo
Author_Institution :
Neural Networks Res. Centre, Helsinki Univ. of Technol., Espoo, Finland
Volume :
1
fYear :
1999
fDate :
36342
Firstpage :
27
Abstract :
Frequently one must deal with natural processes and data for which no known models can be derived from classical systems theory. A solution is that relationships between the elements are described by nonlinear functional expansions called “neural networks”. The most familiar neural-network models make use of supervised learning, which means that the data used for identification must be verified, validated, and preclassified. Such data, however, is very expensive and sometimes even impossible to acquire. A different approach altogether is unsupervised learning that uses raw data, usually available on mass. In the article, the most widespread unsupervised-learning method, the self-organizing map (SOM) algorithm is described. The central idea in this algorithm and in self organization in general, is to use a large number of relatively simple and structurally similar, interacting, statistical submodels. Each submodel describes only a limited domain of observations, but since the submodels can communicate, they can mutually decide what and how large a domain belongs to each submodel. By virtue of such collective interactions it becomes possible to span the whole data space nonlinearly, thereby minimizing the average overall modeling error. As the SOM implements a characteristic nonlinear projection from the input space to a visual display, it can be used, e.g., to reveal process states that otherwise would escape notice. Applications to industry and “data mining” in general are surveyed. The mapping of all electronically available patent abstracts in the world onto a visual display is also reported
Keywords :
self-organising feature maps; unsupervised learning; characteristic nonlinear projection; collective interactions; data mining; data space; input space; large data sets; natural processes; neural networks; nonlinear functional expansions; patent abstracts; raw data; self-organizing map algorithm; self-organizing method; statistical submodels; unsupervised-learning method; visual display; Biomedical imaging; Computer networks; Concurrent computing; Data visualization; Displays; Humans; Medical diagnostic imaging; Neural networks; Speech processing; Supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Processing and Manufacturing of Materials, 1999. IPMM '99. Proceedings of the Second International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
0-7803-5489-3
Type :
conf
DOI :
10.1109/IPMM.1999.792450
Filename :
792450
Link To Document :
بازگشت