• DocumentCode
    4803
  • Title

    REC-STA: Reconfigurable and Efficient Chip Design With SMO-Based Training Accelerator

  • Author

    Chih-Hsiang Peng ; Bo-Wei Chen ; Ta-Wen Kuan ; Po-Chuan Lin ; Jhing-Fa Wang ; Nai-Sheng Shih

  • Author_Institution
    Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    22
  • Issue
    8
  • fYear
    2014
  • fDate
    Aug. 2014
  • Firstpage
    1791
  • Lastpage
    1802
  • Abstract
    Sequential minimal optimization (SMO) and Karush-Kuhn-Tucker condition are often used to solve learning problems in support vector machines. However, during hardware implementation of the SMO algorithm, enhancing chip performance without excessively increasing chip area is often a crucial issue. The solution proposed in this paper is a novel reconfigurable and efficient chip design with SMO-based training accelerator (REC-STA). Two novel methods used in the proposed REC-STA are trimode coarse-grained reconfigurable architecture (TCRA) and triple finite-state-machine with dynamic scheduling The first method modifies the baseline SMO design by developing trimode reconfigurable architectures with parallel and pipeline computing capabilities. The second method provides a schedule for efficient reconfiguration of the TCRA. Use of these methods can remove kernel cache design. For chip manufacturing, the implementation of the REC-STA is synthesized, placed, and routed using the TSMC 0.18-μm technology library. The core size is 2.94 mm × 2.94 mm and the power consumption is 77.3 mW. Compared with the baseline design, the FPGA simulation results show that the proposed architecture requires 50% less memory and 31% fewer gate counts but provides a 16-fold improvement in training performance. The experimental results confirm the efficacy of the proposed architecture and methods.
  • Keywords
    field programmable gate arrays; finite state machines; logic design; optimisation; reconfigurable architectures; sequential circuits; support vector machines; FPGA simulation; Karush-Kuhn-Tucker condition; REC-STA; SMO-based training accelerator; TCRA; TSMC technology library; dynamic scheduling; finite-state-machine; hardware implementation; kernel cache design; learning problems; pipeline computing capabilities; power 77.3 mW; sequential minimal optimization; size 0.18 mum; size 2.94 mm; support vector machines; trimode coarse-grained reconfigurable architecture; trimode reconfigurable architectures; Arrays; Complexity theory; Hardware; Indexes; Support vector machines; Training; Reconfigurable computing; VLSI; VLSI.; sequential minimal optimization (SMO); speaker recognition; support vector machine (SVM); trimode coarse-grained reconfigurable architecture (TCRA); triple finite-state-machine with dynamic scheduling (TFDS);
  • fLanguage
    English
  • Journal_Title
    Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-8210
  • Type

    jour

  • DOI
    10.1109/TVLSI.2013.2278706
  • Filename
    6595532