• DocumentCode
    14680
  • Title

    An Iterative Divide-and-Merge-Based Approach for Solving Large-Scale Least Squares Problems

  • Author

    Chi-Yuan Yeh ; Yu-Ting Peng ; Shie-Jue Lee

  • Author_Institution
    Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
  • Volume
    24
  • Issue
    3
  • fYear
    2013
  • fDate
    Mar-13
  • Firstpage
    428
  • Lastpage
    438
  • Abstract
    Singular value decomposition (SVD) is a popular decomposition method for solving least squares estimation (LSE) problems. However, for large data sets, applying SVD directly on the coefficient matrix is very time consuming and memory demanding in obtaining least squares solutions. In this paper, we propose an iterative divide-and-merge-based estimator for solving large-scale LSE problems. Iteratively, the LSE problem to be solved is processed and transformed to equivalent but smaller LSE problems. In each iteration, the input matrices are subdivided into a set of small submatrices. The submatrices are decomposed by SVD, respectively, and the results are merged, and the resulting matrices become the input of the next iteration. The process is iterated until the resulting matrices are small enough which can then be solved directly and efficiently by SVD. The number of iterations required is determined dynamically according to the size of the input data set. As a result, the requirements in time and space for finding least squares solutions are greatly improved. Furthermore, the decomposition and merging of the submatrices in each iteration can be independently done in parallel. The idea can be easily implemented in MapReduce and experimental results show that the proposed approach can solve large-scale LSE problems effectively.
  • Keywords
    iterative methods; least squares approximations; parallel processing; singular value decomposition; LSE problems; MapReduce; SVD; coefficient matrix; iterative divide-and-merge-based approach; iterative divide-and-merge-based estimator; large-scale least square estimation problems; singular value decomposition; submatrix decomposition; Approximation algorithms; Complexity theory; Educational institutions; Equations; Iterative methods; Least squares approximation; Matrix decomposition; Linear system; MapReduce; batch SVD; error minimization; large-scale data set; least squares solution; matrix decomposition;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2012.161
  • Filename
    6205747