DocumentCode :
3020971
Title :
Human Focused Video Description
Author :
Khan, Muhammad Usman Ghani ; Zhang, Lei ; Gotoh, Yoshihiko
Author_Institution :
Univ. of Sheffield, Sheffield, UK
fYear :
2011
fDate :
6-13 Nov. 2011
Firstpage :
1480
Lastpage :
1487
Abstract :
This contribution addresses generation of natural language descriptions for human actions and behaviour observed in video streams. The work starts with implementation of conventional image processing techniques to extract high-level features from video. Because human is often the most important and also interesting feature, description focuses on humans and their activities. Although feature extraction processes are erroneous at various levels, we explore approaches to put them together to produce a coherent description. Evaluation is made by calculating the overlap similarity score between human authored and machine generated descriptions.
Keywords :
feature extraction; natural language processing; video signal processing; high-level feature extraction; human action; human behaviour; human focused video description; image processing; natural language description; video stream; Face; Feature extraction; Humans; Legged locomotion; Natural languages; Streaming media; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-0062-9
Type :
conf
DOI :
10.1109/ICCVW.2011.6130425
Filename :
6130425
Link To Document :
بازگشت