• DocumentCode
    1315096
  • Title

    A mathematical solution to a network construction problem

  • Author

    Takahashi, Yoshikane

  • Author_Institution
    NTT Cyber Space Labs., Yokosuka, Japan
  • Volume
    47
  • Issue
    2
  • fYear
    2000
  • fDate
    2/1/2000 12:00:00 AM
  • Firstpage
    166
  • Lastpage
    184
  • Abstract
    One of the major open issues in neural networks includes a network construction problem (NCP) to find a procedure, polynomial time if possible, that produces a minimal structure (minimum size, threshold and weight) of a multilayer threshold feed-forward network where its output must not exceed any given admissible distortion from a sample, The NCP includes a subproblem, a network training problem (NTP), where the size is prespecified. Approximate versions of the NCP/NTP have been solved with iterative algorithms for the network construction/training. This paper provides a mathematically rigorous solution to the NCP using rate distortion theory from information theory and linear algebra. This solution is used to develop a mathematical procedure that specifically constructs a minimal structure from the sample. The procedure attains the exact minimum though its computational time is, at worst, nonpolynomial. The paper also constructs a polynomial-time shortcut for approximate minimum sizes, which is a promising alternative to current algorithms
  • Keywords
    feedforward neural nets; iterative methods; learning (artificial intelligence); multilayer perceptrons; rate distortion theory; admissible distortion; computational time; iterative algorithms; minimal structure; multilayer threshold feed-forward network; network construction problem; network training problem; neural networks; polynomial time; rate distortion theory; Feedforward neural networks; Feedforward systems; Information theory; Iterative algorithms; Linear algebra; Multi-layer neural network; Neural networks; Pattern classification; Polynomials; Rate distortion theory;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems I: Fundamental Theory and Applications, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7122
  • Type

    jour

  • DOI
    10.1109/81.828570
  • Filename
    828570