DocumentCode :
2629286
Title :
A multi-module minimization neural network for motion-based scene segmentation
Author :
Chen, Yen-Kuang ; Kung, S.Y.
Author_Institution :
Princeton Univ., NJ, USA
fYear :
1996
fDate :
4-6 Sep 1996
Firstpage :
371
Lastpage :
380
Abstract :
A competitive learning network, called multi-module minimization (MMM) neural network, is proposed for unsupervised classification. Our objective is to provide a general framework to divide a set of input patterns into a number of clusters such that the patterns of the same cluster exhibit any pre-specified similarity measure (i.e. not limited only to RBF). As an example of non-RBF measure, let us look into a motion-based segmentation problem. The image frame can be divided into different regions (segments) each of which is characterized by a consistent affine motion. Algebraically, this leads to an LBF similarity criterion-because each region can be characterized by a 3-dimensional hyperplane. In order to apply the traditional RBF clustering techniques (e.g. VQ, k-mean), it requires a preprocessing step such as taking the Hough transform, which itself creates additional ambiguity. This problem is avoided in a direct approach such as the proposed MMM neural network. It allows us to directly cluster the tracked features into different moving objects by means of an LBF cost function. In general, the primary cost function should be carefully chosen to reflect the true physical model of the application. By minimizing the cost function, we can categorize a set of input patterns into a number of clusters. Because the primary similarity measure is no longer Euclidean type, it may become necessary to take spatial neighborhood into account as a secondary cost function. Still, a third cost function, reflecting the MDL type criterion, needs to be added so that noisy or spurious patterns will not be mistakenly modeled as a meaningful class. Accordingly, we have proposed an EM-type learning algorithm which uses all or part of the three cost functions mentioned above. A convergence proof for this algorithm is provided. Simulation results demonstrate that the MMM neural network does capture different motions and yield fairly accurate segmentation and motion-compensated frames
Keywords :
convergence; image segmentation; neural nets; unsupervised learning; 3-dimensional hyperplane; EM-type learning algorithm; Hough transform; RBF clustering techniques; competitive learning network; consistent affine motion; convergence proof; motion-based scene segmentation; motion-compensated frames; multi-module minimization neural network; primary cost function; primary similarity measure; similarity measure; spatial neighborhood; unsupervised classification; Arithmetic; Clustering algorithms; Clustering methods; Convergence; Cost function; Euclidean distance; Image segmentation; Layout; Neural networks; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks for Signal Processing [1996] VI. Proceedings of the 1996 IEEE Signal Processing Society Workshop
Conference_Location :
Kyoto
ISSN :
1089-3555
Print_ISBN :
0-7803-3550-3
Type :
conf
DOI :
10.1109/NNSP.1996.548367
Filename :
548367
Link To Document :
بازگشت