A Tutorial on Neural Networks and Gradient-free Training
November 26, 2022 ยท The Cartographer ยท ๐ arXiv.org
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Tutorial on Neural Networks and Gradient-free Training"
Evidence collected by the PWNC Scanner
Authors
Turibius Rozario, Arjun Trivedi, Ankit Goel
arXiv ID
2211.17217
Category
eess.SY: Systems & Control (EE)
Cross-listed
cs.LG
Citations
1
Venue
arXiv.org
Last Checked
4 days ago
Abstract
This paper presents a compact, matrix-based representation of neural networks in a self-contained tutorial fashion. Specifically, we develop neural networks as a composition of several vector-valued functions. Although neural networks are well-understood pictorially in terms of interconnected neurons, neural networks are mathematical nonlinear functions constructed by composing several vector-valued functions. Using basic results from linear algebra, we represent a neural network as an alternating sequence of linear maps and scalar nonlinear functions, also known as activation functions. The training of neural networks requires the minimization of a cost function, which in turn requires the computation of a gradient. Using basic multivariable calculus results, the cost gradient is also shown to be a function composed of a sequence of linear maps and nonlinear functions. In addition to the analytical gradient computation, we consider two gradient-free training methods and compare the three training methods in terms of convergence rate and prediction accuracy.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Systems & Control (EE)
๐
๐
The Cartographer
๐
๐
The Cartographer
Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey
๐
๐
The Cartographer
Wireless Network Design for Control Systems: A Survey
R.I.P.
๐ป
Ghosted
Learning-based Model Predictive Control for Safe Exploration
R.I.P.
๐ป
Ghosted
Safety-Critical Model Predictive Control with Discrete-Time Control Barrier Function
R.I.P.
๐ป
Ghosted