مرکز منطقه ای اطلاع رساني علوم و فناوري - ActivityNet: A large-scale video benchmark for human activity understanding

DocumentCode :

3672161

Title :

ActivityNet: A large-scale video benchmark for human activity understanding

Author :

Fabian Caba Heilbron;Victor Escorcia;Bernard Ghanem;Juan Carlos Niebles

Author_Institution :

Universidad del Norte, Colombia

fYear :

2015

fDate :

6/1/2015 12:00:00 AM

Firstpage :

961

Lastpage :

970

Abstract :

In spite of many dataset efforts for human action recognition, current computer vision algorithms are still severely limited in terms of the variability and complexity of the actions that they can recognize. This is in part due to the simplicity of current benchmarks, which mostly focus on simple actions and movements occurring on manually trimmed videos. In this paper we introduce ActivityNet, a new large-scale video benchmark for human activity understanding. Our benchmark aims at covering a wide range of complex human activities that are of interest to people in their daily living. In its current version, ActivityNet provides samples from 203 activity classes with an average of 137 untrimmed videos per class and 1.41 activity instances per video, for a total of 849 video hours. We illustrate three scenarios in which ActivityNet can be used to compare algorithms for human activity understanding: untrimmed video classification, trimmed activity classification and activity detection.

Keywords :

"Benchmark testing","Taxonomy","Cleaning","Semantics","Organizations","Complexity theory","YouTube"

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2015.7298698

Filename :

7298698

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3672161