• DocumentCode
    226830
  • Title

    Post-processing of unsupervised dictionary learning in handwritten digit recognition

  • Author

    Phaisangittisagul, Ekachai ; Chongprachawat, Rapeepol

  • Author_Institution
    Dept. of Electr. Eng., Kasetsart Univ., Bangkok, Thailand
  • fYear
    2014
  • fDate
    24-26 Sept. 2014
  • Firstpage
    166
  • Lastpage
    170
  • Abstract
    To achieve high performance in object recognition, a high-level feature representation is play an essential role to transform a raw input data (low-level) into a new representation. Unsupervised feature learning is one of the most successful methods that is widely used in machine learning literatures for creating a high-level feature to improve the supervised learning problems. The main concept of unsupervised feature learning is, in some sense, to encode input knowledge for gaining not only latent features but compact high-fidelity representation. In particular, a sparse coding has proven to be an effective tool as a prior lead to state-of-the-art performance in many benchmark datasets. In sparse coding, an input data can be represented as a sparse linear combination of a set of training overcomplete dictionary. However, an open problem in sparse coding is how to create and correctly choose the dictionary for representing a given input. Moreover, some bases of the learned dictionary is highly dependent and non-orthogonal among itself. In this study, we propose a post-processing scheme to transform the overcomplete dictionary obtained from unsupervised feature learning using Principle Component Analysis (PCA). The goal of our post-processing is to reduce the number of bases in the unsupervised dictionary learning for real-time applications. In experimental results, while the performance without post-processing the unsupervised dictionary learning is better than that of using PCA, it takes more computation and requires more resources during the test time.
  • Keywords
    handwriting recognition; object recognition; principal component analysis; unsupervised learning; PCA; benchmark datasets; handwritten digit recognition; high-level feature representation; machine learning literatures; object recognition; principle component analysis; real-time applications; sparse coding; unsupervised dictionary learning; unsupervised feature learning; Dictionaries; Encoding; Neurons; Optimization; Prediction algorithms; Principal component analysis; Vectors; MNIST dataset; dictionary; high-level feature; sparse autoencoder; unlabeled data; unsupervised feature learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technologies (ISCIT), 2014 14th International Symposium on
  • Conference_Location
    Incheon
  • Type

    conf

  • DOI
    10.1109/ISCIT.2014.7011893
  • Filename
    7011893