Title : 
Finding structure in home videos by probabilistic hierarchical clustering
         
        
            Author : 
Gatica-Perez, Daniel ; Loui, Alexander ; Sun, Ming-Ting
         
        
            Author_Institution : 
Dalle Molle Inst. for Perceptual Artificial Intelligence, Martigny, Switzerland
         
        
        
        
        
            fDate : 
6/1/2003 12:00:00 AM
         
        
        
        
            Abstract : 
Accessing, organizing, and manipulating home videos present technical challenges due to their unrestricted content and lack of storyline. We present a methodology to discover cluster structure in home videos, which uses video shots as the unit of organization, and is based on two concepts: (1) the development of statistical models of visual similarity, duration, and temporal adjacency of consumer video segments and (2) the reformulation of hierarchical clustering as a sequential binary Bayesian classification process. A Bayesian formulation allows for the incorporation of prior knowledge of the structure of home video and offers the advantages of a principled methodology. Gaussian mixture models are used to represent the class-conditional distributions of intra- and inter-segment visual and temporal features. The models are then used in the probabilistic clustering algorithm, where the merging order is a variation of highest confidence first, and the merging criterion is maximum a posteriori. The algorithm does not need any ad-hoc parameter determination. We present extensive results on a 10-h home-video database with ground truth which thoroughly validate the performance of our methodology with respect to cluster detection, individual shot-cluster labeling, and the effect of prior selection.
         
        
            Keywords : 
Bayes methods; feature extraction; image classification; image segmentation; pattern clustering; probability; statistical analysis; video databases; video signal processing; Bayesian formulation; Gaussian mixture models; class-conditional distributions; cluster detection; consumer video segments; duration; ground truth; home videos structure; home-video database; inter-segment temporal features; inter-segment visual features; intra-segment temporal features; intra-segment visual features; maximum a posteriori merging criterion; probabilistic clustering algorithm; probabilistic hierarchical clustering; sequential binary Bayesian classification; shot-cluster labeling; statistical models; temporal adjacency; video shots; video-segment feature extraction; video-segment feature selection; visual similarity; Bayesian methods; Clustering algorithms; Decision theory; Labeling; Merging; Organizing; Spatial databases; Sun; Video recording; Visual databases;
         
        
        
            Journal_Title : 
Circuits and Systems for Video Technology, IEEE Transactions on
         
        
        
        
        
            DOI : 
10.1109/TCSVT.2003.813428