DocumentCode
1955321
Title
Improving Semantic Scene Categorization by Exploiting Audio-Visual Features
Author
Zhu, Songhao ; Yan, Junchi ; Liu, Yuncai
Author_Institution
Shanghai Jiao tong Univ., Shanghai, China
fYear
2009
fDate
20-23 Sept. 2009
Firstpage
435
Lastpage
440
Abstract
We address the issue of categorizing scenes from feature films into semantic classifications based on the audio-visual cues. Specifically, we first exploit the grammar of film production to specify the semantic content of scenes. Then, each scene is classified into one of the following categories: conversation, action and suspense. Finally, to achieve more specific scene and consist with human perception, conversation scene is further categorizes into emotional conversation and common one, and action scene is further categorizes into gunfight, beating and chasing scene. This work is a step toward browsing and retrieval content of feature films in limited bandwidth, video repository, and rating of feature films of interest effectively and efficiently.
Keywords
content-based retrieval; emotion recognition; feature extraction; pattern classification; video signal processing; audio visual cues; audio visual features; film production; human perception; retrieval content; semantic classifications; semantic content; semantic scene categorization; video repository; Cameras; Content based retrieval; Face detection; Graphics; Humans; Image retrieval; Information analysis; Layout; Motion pictures; Production;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Graphics, 2009. ICIG '09. Fifth International Conference on
Conference_Location
Xi´an, Shanxi
Print_ISBN
978-1-4244-5237-8
Type
conf
DOI
10.1109/ICIG.2009.17
Filename
5437899
Link To Document