مرکز منطقه ای اطلاع رساني علوم و فناوري - A multi-module minimization neural network for motion-based scene segmentation

DocumentCode :

2629286

Title :

A multi-module minimization neural network for motion-based scene segmentation

Author :

Chen, Yen-Kuang ; Kung, S.Y.

Author_Institution :

Princeton Univ., NJ, USA

fYear :

1996

fDate :

4-6 Sep 1996

Firstpage :

371

Lastpage :

380

Abstract :

A competitive learning network, called multi-module minimization (MMM) neural network, is proposed for unsupervised classification. Our objective is to provide a general framework to divide a set of input patterns into a number of clusters such that the patterns of the same cluster exhibit any pre-specified similarity measure (i.e. not limited only to RBF). As an example of non-RBF measure, let us look into a motion-based segmentation problem. The image frame can be divided into different regions (segments) each of which is characterized by a consistent affine motion. Algebraically, this leads to an LBF similarity criterion-because each region can be characterized by a 3-dimensional hyperplane. In order to apply the traditional RBF clustering techniques (e.g. VQ, k-mean), it requires a preprocessing step such as taking the Hough transform, which itself creates additional ambiguity. This problem is avoided in a direct approach such as the proposed MMM neural network. It allows us to directly cluster the tracked features into different moving objects by means of an LBF cost function. In general, the primary cost function should be carefully chosen to reflect the true physical model of the application. By minimizing the cost function, we can categorize a set of input patterns into a number of clusters. Because the primary similarity measure is no longer Euclidean type, it may become necessary to take spatial neighborhood into account as a secondary cost function. Still, a third cost function, reflecting the MDL type criterion, needs to be added so that noisy or spurious patterns will not be mistakenly modeled as a meaningful class. Accordingly, we have proposed an EM-type learning algorithm which uses all or part of the three cost functions mentioned above. A convergence proof for this algorithm is provided. Simulation results demonstrate that the MMM neural network does capture different motions and yield fairly accurate segmentation and motion-compensated frames

Keywords :

convergence; image segmentation; neural nets; unsupervised learning; 3-dimensional hyperplane; EM-type learning algorithm; Hough transform; RBF clustering techniques; competitive learning network; consistent affine motion; convergence proof; motion-based scene segmentation; motion-compensated frames; multi-module minimization neural network; primary cost function; primary similarity measure; similarity measure; spatial neighborhood; unsupervised classification; Arithmetic; Clustering algorithms; Clustering methods; Convergence; Cost function; Euclidean distance; Image segmentation; Layout; Neural networks; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks for Signal Processing [1996] VI. Proceedings of the 1996 IEEE Signal Processing Society Workshop

Conference_Location :

Kyoto

ISSN :

1089-3555

Print_ISBN :

0-7803-3550-3

Type :

conf

DOI :

10.1109/NNSP.1996.548367

Filename :

548367

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2629286