Title :
Key frame extraction using MPEG-7 motion descriptors
Author :
Narasimha, R. ; Savakis, A. ; Rao, R.M. ; De Queiroz, R.
Author_Institution :
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
We address the problem of key frame extraction in the compressed domain that is of great importance in content-based system applications. A novel MPEG-7 motion activity descriptor is discussed that is a combination of temporal and spatial descriptors. These descriptors represent both temporal motion intensity as well as spatial distribution of motion activity. It is assumed that the apriori information about the shot boundaries is available. The temporal descriptors are obtained by classifying the shots into five different intensity levels based on fuzzy membership functions. A high value of intensity indicates high activity and a low value of intensity indicates low activity. The spatial descriptors are obtained using motion vectors. The individual frames are characterized into spatial regions depending on the change in motion activity between successive frames. The main motivation behind this approach is to pick those frames as key frames that have maximum centralized spatial activity and high motion intensity. The motion intensity and spatial distribution are then fed to a neural network that decides the key frames based on maximum temporal activity and centralized spatial distribution. Results illustrate that the proposed approach is computationally less intensive once the network is trained and works much better than selecting the first frame and middle frame of the shot as key frame for a wide range of video sequences.
Keywords :
feature extraction; neural nets; MPEG-7 motion descriptors; apriori information; content-based system applications; fuzzy membership functions; key frame extraction; neural network; shot boundaries; spatial distribution; temporal motion intensity; temporal-spatial descriptors; video sequences; Application software; Computer networks; Content based retrieval; Content management; Drives; Information retrieval; Internet; MPEG 7 Standard; Neural networks; Video sequences;
Conference_Titel :
Signals, Systems and Computers, 2004. Conference Record of the Thirty-Seventh Asilomar Conference on
Print_ISBN :
0-7803-8104-1
DOI :
10.1109/ACSSC.2003.1292250