Title :
Fast online L1-dictionary learning algorithms for novel document detection
Author :
Kasiviswanathan, Shiva Prasad
Author_Institution :
Gen. Electr. Res., San Ramon, CA, USA
Abstract :
Online L1-dictionary learning, introduced by Kasiviswanathan et al. [1], is the process of generating a sequence of (dictionary) matrices {At+1}, one at a time, for t = 0, 1,.... After committing to At+1, a pair of matrices (Pt+1,Xt+1) is revealed and the online algorithm incurs a cost of ∥Pt+1 - At+1Xt+1∥1. The goal of the online algorithm is to ensure that the total cost up to each time is not much larger than the smallest total cost of any fixed A chosen with the benefit of hindsight. In this paper, we study three different algorithms for this problem based on the schemes of dual averaging, projected gradient, and alternating direction method of multipliers. We focus on the performance of these algorithms for the application of novel document detection, where online dictionary learning could be used to automatically identify emerging topics of discussion from a voluminous stream of text documents in a scalable manner. Our empirical results show the relative benefits of these three algorithms for this application.
Keywords :
document handling; learning (artificial intelligence); matrix algebra; text detection; alternating multiplier direction method; document detection; dual averaging schemes; fast online L1-dictionary learning algorithms; matrix sequence; projected gradient schemes; text documents; Algorithm design and analysis; Cost function; Dictionaries; Encoding; Media; Prediction algorithms; Sparse matrices; Dictionary Learning; Online Algorithms; Sparse Coding; Topic Detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639341