DocumentCode
14680
Title
An Iterative Divide-and-Merge-Based Approach for Solving Large-Scale Least Squares Problems
Author
Chi-Yuan Yeh ; Yu-Ting Peng ; Shie-Jue Lee
Author_Institution
Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
Volume
24
Issue
3
fYear
2013
fDate
Mar-13
Firstpage
428
Lastpage
438
Abstract
Singular value decomposition (SVD) is a popular decomposition method for solving least squares estimation (LSE) problems. However, for large data sets, applying SVD directly on the coefficient matrix is very time consuming and memory demanding in obtaining least squares solutions. In this paper, we propose an iterative divide-and-merge-based estimator for solving large-scale LSE problems. Iteratively, the LSE problem to be solved is processed and transformed to equivalent but smaller LSE problems. In each iteration, the input matrices are subdivided into a set of small submatrices. The submatrices are decomposed by SVD, respectively, and the results are merged, and the resulting matrices become the input of the next iteration. The process is iterated until the resulting matrices are small enough which can then be solved directly and efficiently by SVD. The number of iterations required is determined dynamically according to the size of the input data set. As a result, the requirements in time and space for finding least squares solutions are greatly improved. Furthermore, the decomposition and merging of the submatrices in each iteration can be independently done in parallel. The idea can be easily implemented in MapReduce and experimental results show that the proposed approach can solve large-scale LSE problems effectively.
Keywords
iterative methods; least squares approximations; parallel processing; singular value decomposition; LSE problems; MapReduce; SVD; coefficient matrix; iterative divide-and-merge-based approach; iterative divide-and-merge-based estimator; large-scale least square estimation problems; singular value decomposition; submatrix decomposition; Approximation algorithms; Complexity theory; Educational institutions; Equations; Iterative methods; Least squares approximation; Matrix decomposition; Linear system; MapReduce; batch SVD; error minimization; large-scale data set; least squares solution; matrix decomposition;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2012.161
Filename
6205747
Link To Document