Training Neural Network With Second-Order Algorithm

Posted on:2019-10-07

Degree:Master

Type:Thesis

Country:China

Candidate:W Huang

Full Text:PDF

GTID:2428330545498033

Subject:Probability theory and mathematical statistics

Abstract/Summary:

PDF Full Text Request

Since Hinton et al.proposed the BP back-propagation theory,the stochastic gra-dient descent method has become a common method for solving neural networks such as fully-connected neural networks,long-term and short-term memory neural networks,and convolutional neural networks.Although the Q-liner's stochastic gradient descent method can solve the ideal parameter value,the algorithm often needs more iterations to obtain the optimal parameter value.In order for the neural network's solution algo-rithm to converge within a small number of iterations,a faster convergence algorithm need to be adopted.In this paper,we first introduce the basic structure of a fully connected Neural Net-work and the approximation theory of neural networks.This provides a solid theoretical foundation for the following study in Neural Networks.In Chapter 2,we introduce the steepest gradient descent algorithm,Newton al-gorithm,conjugate gradient and SESOP algorithm,and compare their convergence speeds.In Chapter 3,similar to the local gradient in BP algorithm,the concept of second-order local Hessian matrix and second-order local partial derivatives are proposed for the specific structure of full-connected neural networks,and its back propagation formula is given.Finally,we propose the Damped-Gauss-Newton algorithm and the SESOP algorithm to solve parameters in Fully Connected Neural Network.In Chapter 4,we use the open-source MNIST dataset to verify that our two al-gorithms proposed in this paper are optimal to the BP algorithm in the convergence speed.In Chapter 5,we discuss the research prospects of two proposed algorithms.

Keywords/Search Tags:

Neural networks, Newton-Raphson method, Damped-Gauss-Newton, SESOP

PDF Full Text Request

Related items

1	The Research And Modeling Of Source Localization Based On Improved Gauss Newton Iterative Method
2	Research On Positioning Technology Based On Radio Interferometry
3	The Research Of Microwave Tomography Algorithm Based On Improved Newton Method
4	Research And Application Of Fractal Art Graphics Generating Technique
5	Analysis On The Passive Bounding Quadruped Of Articulated Leg
6	Research On Kinematics Calibration Method Based On MH80? Robot
7	Research On Analog Circuit Simulation Technology Based On Evolution Method
8	The Mandelbrot-Julia Sets Of Newton's Method
9	Hardware Implementation Of Quasi-newton Neural Network Training Algorithm Based On Approximate Computation
10	ABS Criterion For Test Equating And The Application Of Heuristic Algorithms