MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  A Framework for the Development of Globally Convergent Adaptive Learning Rate Algorithms

Download:
Download as a PDF
by George D. Magoulas, Vassilis P. Plagianakos, George S. Androulakis, Michael N. Vrahatis, Department Of Informatics
http://www.math.upatras.gr/~vpp/pdf/a-2.pdf
Add To MetaCart

Abstract:

Abstract:- In this paper we propose a framework for developing globally convergent batch training algorithms with adaptive learning rate. The proposed framework provides conditions under which global convergence is guaranteed for adaptive learning rate training algorithms. To this end, the learning rate is appropriately tuned along the given descent direction. Providing conditions regarding the search direction and the corresponding stepsize length this framework can also guarantee global convergence for training algorithms that use a different learning rate for each weight. To illustrate the effectiveness of the proposed approach on various training algorithms simulation results are provided. Key-Words:- Global convergence, learning rate adaptation, batch training algorithms, steepest descent, feedforward neural networks. 1.

Citations

2140 Learning Internal Representations by Error Propagation – Rumelhart, Hinton, et al. - 1986
904 Practical optimization – Gill, Murray, et al. - 1981
599 Numerical Methods for Unconstrained Optimization and Nonlinear Equations, Prentice-Hall – DENNIS, SCHNABEL - 1983
371 A direct adaptive method for faster backpropagation learning: the RPROP algorithm – Riedmiller, Braun - 1993
352 Rheinboldt, Iterative Solution of Nonlinear Equations – Ortega, W - 2000
245 Increased Rates of Convergence Through Learning Rate Adaptation. Neural Networks – Jacobs - 1988
223 Faster-Learning Variations on Back-Propagation: An Empirical Study – Fahlman - 1988
159 A scaled conjugate gradient algorithm for fast supervised learning – Møller - 1993
94 Minimization of Functions Having Lipschitz Continuous First-Partial Derivatives – Armijo - 1966
91 Accelerating the convergence of the backpropagation method – Vogl, Mangis, et al. - 1988
73 Improving the convergence of back-propagation learning with second order methods – Becker, LeCun - 1989
66 Optimization: Algorithms and Consistent Approximations – Polak - 1997
58 Theory of algorithms for unconstrained optimization – Nocedal
42 Global convergence properties of conjugate gradient methods for optimization – GILBERT, NOCEDAL - 1992
42 Convergence conditions for ascent methods – Wolfe - 1969
32 Acceleration Techniques for the Back propagation Algorithm – Silva, Almeida - 1990
30 Accelerated backpropagation learning: two optimization methods – Battiti - 1989
20 Convergence conditions for ascent methods. II: Some corrections – Wolfe - 1971
18 An adaptive training algorithm for back–propagation networks – Chan, Fallside - 1987
18 Automatic learning rate maximization by on-line estimation of the Hessian's eigenvectors – Cun, Simard, et al. - 1993
18 Improving the Convergence of the Backpropagation Algorithm Using Learning Rate Adaptation Methods – Magoulas, Vrahatis, et al. - 1999
13 A class of gradient unconstrained minimization algorithms with adaptive stepsize – Vrahatis, Androulakis, et al. - 2000
11 An analysis of premature saturation – Lee, Oh, et al. - 1993
10 An accelerated learning algorithm for multilayer perceptron networks – Parlos, Fermandez, et al. - 1994
10 Speeding–up backpropagation – A comparison of orthogonal techniques – Pfister, Rojas - 1993
9 Androulakis, “Effective back-propagation with variable stepsize – Magoulas, Vrahatis, et al. - 1997
7 Cauchy’s method of minimization – Goldstein - 1962
6 Rescaling of variables – Rigler, Irvine, et al. - 1991