مرکز منطقه ای اطلاع رساني علوم و فناوري - Rényi Divergence Based Generalization for Learning of Classification Restricted Boltzmann Machines

DocumentCode :

243646

Title :

Rényi Divergence Based Generalization for Learning of Classification Restricted Boltzmann Machines

Author :

Qian Yu ; Yuexian Hou ; Xiaozhao Zhao ; Guochen Cheng

Author_Institution :

Sch. of Comput. Software, Tianjin Univ., Tianjin, China

fYear :

2014

fDate :

14-14 Dec. 2014

Firstpage :

692

Lastpage :

697

Abstract :

As a derivative of Restricted Boltzmann Machine (RBM), classification RBM (Class RBM) is proved to be an effective classifier with a probabilistic interpretation. Several elegant learning methods/models related to Class RBM have been proposed. This paper proposes and analyzes a Rényi divergence based generalization for discriminative learning objective of Class RBM. Specifically, we extend the Conditional Log Likelihood (CLL) objective to a general learning criterion. We prove that, some existing popular training methods can be derived from this generalization, via adjusting the parameters to specific values. Intuitively, the regularization with different settings of parameters constrain the learned RBM distribution in different ways, and the parameter setting that provide a suitable distribution constraints for a particular sample set leads to the optimal performance. Moreover, we show that this generalized criterion actually extends the CLL objective with a Rényi divergence-based regularization. The uniform distribution used in this divergence-based regularization can be replaced by some sample-based distribution. This modification is applicable to any specific case of the general objective, and we call the appended loss as general margin. The proposed generalization enables an effective model selection procedure and experiments on human face recognition and document classification achieved significant performance improvement over the existing learning methods. It is also shown empirically that general margin loss is able to stabilize the parameter sensitivity and further improve the performance of the classifiers.

Keywords :

Boltzmann machines; document handling; face recognition; generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; statistical distributions; CLL objective; Class RBM; Rényi divergence based generalization; Rényi divergence-based regularization; classification RBM; classification restricted Boltzmann machines; conditional log likelihood objective; discriminative learning objective; distribution constraint; document classification; general learning criterion; human face recognition; learning method; learning model; parameter sensitivity; probabilistic interpretation; sample-based distribution; training method; Databases; Educational institutions; Error analysis; Face recognition; Learning systems; Linear programming; Training; Classification RBM; Discriminative Learning; Rényi Divergence;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining Workshop (ICDMW), 2014 IEEE International Conference on

Conference_Location :

Shenzhen

Print_ISBN :

978-1-4799-4275-6

Type :

conf

DOI :

10.1109/ICDMW.2014.17

Filename :

7022663

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=243646