๐ฎ
๐ฎ
The Ethereal
Expressing linear equality constraints in feedforward neural networks
November 08, 2022 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: CS_iter_NN_loss_Lag_param_perm.pdf, LICENSE, Lagrange_results_perm.pickle, README.md, requirements.txt, xor_newcustom_linesearch_multipleruns_lag_init_arXiv_v2.py
Authors
Anand Rangarajan, Pan He, Jaemoon Lee, Tania Banerjee, Sanjay Ranka
arXiv ID
2211.04395
Category
cs.LG: Machine Learning
Cross-listed
cs.CV,
math.OC
Citations
5
Venue
arXiv.org
Repository
https://github.com/anandrajan0/smartalec
โญ 2
Last Checked
3 months ago
Abstract
We seek to impose linear, equality constraints in feedforward neural networks. As top layer predictors are usually nonlinear, this is a difficult task if we seek to deploy standard convex optimization methods and strong duality. To overcome this, we introduce a new saddle-point Lagrangian with auxiliary predictor variables on which constraints are imposed. Elimination of the auxiliary variables leads to a dual minimization problem on the Lagrange multipliers introduced to satisfy the linear constraints. This minimization problem is combined with the standard learning problem on the weight matrices. From this theoretical line of development, we obtain the surprising interpretation of Lagrange parameters as additional, penultimate layer hidden units with fixed weights stemming from the constraints. Consequently, standard minimization approaches can be used despite the inclusion of Lagrange parameters -- a very satisfying, albeit unexpected, discovery. Examples ranging from multi-label classification to constrained autoencoders are envisaged in the future. The code has been made available at https://github.com/anandrajan0/smartalec
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal