DocumentCode :
3578638
Title :
A mahout based image classification framework for very large dataset
Author :
Jun He ; Zhi-Yun Xue ; Ming-Wei Gao ; Hao Wu
Author_Institution :
Sch. of Electron. & Inf. Eng., Nanjing Univ. of Inf. Sci. & Technol., Nanjing, China
fYear :
2014
Firstpage :
119
Lastpage :
122
Abstract :
In this paper, we present a distributed computing framework for image classification towards the current challenge of image big data due to enormous streaming image data sources, such as image sharing over online social network and massive video surveillance streams from ubiquitous cameras all over our daily life. The proposed framework consists of four modules aiming at feature extraction, dimension reduction, bag of feature modeling, and supervised learning respectively. This distributed computing framework is implemented on Hadoop with Mahout support. We apply the framework for classifying whether a person is on calling or not in a surveillance video to verify the correctness and scalability.
Keywords :
Big Data; feature extraction; image classification; learning (artificial intelligence); parallel processing; visual databases; Hadoop; Mahout based image classification framework; Mahout support; bag of feature modeling; dimension reduction; distributed computing framework; feature extraction; image Big Data; image data sources streaming; image sharing; massive video surveillance streams; online social network; supervised learning; ubiquitous cameras; very large dataset; Accuracy; Classification algorithms; Clustering algorithms; Computational modeling; Multicore processing; Predictive models; Principal component analysis; Bag of Feature; Big Data; Image classification; Map-Reduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing and Internet of Things (CCIOT), 2014 International Conference on
Print_ISBN :
978-1-4799-4765-2
Type :
conf
DOI :
10.1109/CCIOT.2014.7062518
Filename :
7062518
Link To Document :
بازگشت