مرکز منطقه ای اطلاع رساني علوم و فناوري - Parallel Lasso for Large-Scale Video Concept Detection

DocumentCode :

1369158

Title :

Parallel Lasso for Large-Scale Video Concept Detection

Author :

Geng, Bo ; Li, Yangxi ; Tao, Dacheng ; Wang, Meng ; Zha, Zheng-Jun ; Xu, Chao

Author_Institution :

Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China

Volume :

Issue :

fYear :

2012

Firstpage :

Lastpage :

Abstract :

Existing video concept detectors are generally built upon the kernel based machine learning techniques, e.g., support vector machines, regularized least squares, and logistic regression, just to name a few. However, in order to build robust detectors, the learning process suffers from the scalability issues including the high-dimensional multi-modality visual features and the large-scale keyframe examples. In this paper, we propose parallel lasso (Plasso) by introducing the parallel distributed computation to significantly improve the scalability of lasso (the l₁ regularized least squares). We apply the parallel incomplete Cholesky factorization to approximate the covariance statistics in the preprocess step, and the parallel primal-dual interior-point method with the Sherman-Morrison-Woodbury formula to optimize the model parameters. For a dataset with n samples in a d-dimensional space, compared with lasso, Plasso significantly reduces complexities from the original O(d³) for computational time and O(d²) for storage space to O(h²d/m) and O(hd/m) , respectively, if the system has m processors and the reduced dimension h is much smaller than the original dimension d . Furthermore, we develop the kernel extension of the proposed linear algorithm with the sample reweighting schema, and we can achieve similar time and space complexity improvements [time complexity from O(n³) to O(h²n/m) and the space complexity from O(n²) to O(hn/m), for a dataset with n training examples]. Experimental results on TRECVID video concept detection challenges suggest that the proposed method can obtain significant time and space savings for training effective detectors with limited communication overhead.

Keywords :

computational complexity; covariance analysis; feature extraction; learning (artificial intelligence); least squares approximations; matrix decomposition; parallel processing; video signal processing; Sherman-Morrison-Woodbury formula; complexity reduction; computational time; covariance statistics; high-dimensional multimodality visual features; kernel based machine learning technique; kernel extension; large-scale keyframe; large-scale video concept detection; learning process; linear algorithm; logistic regression; parallel distributed computation; parallel incomplete Cholesky factorization; parallel lasso; parallel primal-dual interior-point method; regularized least squares; sample reweighting schema; scalability issue; space complexity; storage space; support vector machine; time complexity; Algorithm design and analysis; Complexity theory; Educational institutions; Feature extraction; Kernel; Machine learning; Optimization; Incomplete cholosky factorization; lasso; parallel learning; video concept detection;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2011.2174781

Filename :

6069863

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1369158