مرکز منطقه ای اطلاع رساني علوم و فناوري - Almost All Learning Machines are Singular

DocumentCode :

2821237

Title :

Almost All Learning Machines are Singular

Author :

Watanabe, Sumio

Author_Institution :

Precision & Intelligence Lab., Tokyo Inst. of Technol., Yokohama

fYear :

2007

fDate :

1-5 April 2007

Firstpage :

383

Lastpage :

388

Abstract :

A learning machine is called singular if its Fisher information matrix is singular. Almost all learning machines used in information processing are singular, for example, layered neural networks, normal mixtures, binomial mixtures, Bayes networks, hidden Markov models, Boltzmann machines, stochastic context-free grammars, and reduced rank regressions are singular. In singular learning machines, the likelihood function can not be approximated by any quadratic form of the parameter. Moreover, neither the distribution of the maximum likelihood estimator nor the Bayes a posteriori distribution converges to the normal distribution, even if the number of training samples tends to infinity. Therefore, the conventional statistical learning theory does not hold in singular learning machines. This paper establishes the new mathematical foundation for singular learning machines. We propose that, by using resolution of singularities, the likelihood function can be represented as the standard form, by which we can prove the asymptotic behavior of the generalization errors of the maximum likelihood method and the Bayes estimation. The result will be a base on which training algorithms of singular learning machines are devised and optimized

Keywords :

Bayes methods; learning (artificial intelligence); matrix algebra; maximum likelihood estimation; Bayes a posteriori distribution; Bayes estimation; Fisher information matrix; information processing; learning machines; maximum likelihood estimator; statistical learning; Computational intelligence; Gaussian distribution; Hidden Markov models; Information processing; Learning systems; Machine learning; Maximum likelihood estimation; Neural networks; Statistical learning; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Foundations of Computational Intelligence, 2007. FOCI 2007. IEEE Symposium on

Conference_Location :

Honolulu, HI

Print_ISBN :

1-4244-0703-6

Type :

conf

DOI :

10.1109/FOCI.2007.371500

Filename :

4233934

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2821237