Expressing linear equality constraints in feedforward neural networks

November 08, 2022 · Entered Twilight · 🏛 arXiv.org

Repo contents: CS_iter_NN_loss_Lag_param_perm.pdf, LICENSE, Lagrange_results_perm.pickle, README.md, requirements.txt, xor_newcustom_linesearch_multipleruns_lag_init_arXiv_v2.py

Authors Anand Rangarajan, Pan He, Jaemoon Lee, Tania Banerjee, Sanjay Ranka arXiv ID 2211.04395 Category cs.LG: Machine Learning Cross-listed cs.CV, math.OC Citations 5 Venue arXiv.org Repository https://github.com/anandrajan0/smartalec ⭐ 2 Last Checked 3 months ago

Abstract

We seek to impose linear, equality constraints in feedforward neural networks. As top layer predictors are usually nonlinear, this is a difficult task if we seek to deploy standard convex optimization methods and strong duality. To overcome this, we introduce a new saddle-point Lagrangian with auxiliary predictor variables on which constraints are imposed. Elimination of the auxiliary variables leads to a dual minimization problem on the Lagrange multipliers introduced to satisfy the linear constraints. This minimization problem is combined with the standard learning problem on the weight matrices. From this theoretical line of development, we obtain the surprising interpretation of Lagrange parameters as additional, penultimate layer hidden units with fixed weights stemming from the constraints. Consequently, standard minimization approaches can be used despite the inclusion of Lagrange parameters -- a very satisfying, albeit unexpected, discovery. Examples ranging from multi-label classification to constrained autoencoders are envisaged in the future. The code has been made available at https://github.com/anandrajan0/smartalec