Expressing linear equality constraints in feedforward neural networks

November 08, 2022 ยท Entered Twilight ยท ๐Ÿ› arXiv.org

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: CS_iter_NN_loss_Lag_param_perm.pdf, LICENSE, Lagrange_results_perm.pickle, README.md, requirements.txt, xor_newcustom_linesearch_multipleruns_lag_init_arXiv_v2.py

Authors Anand Rangarajan, Pan He, Jaemoon Lee, Tania Banerjee, Sanjay Ranka arXiv ID 2211.04395 Category cs.LG: Machine Learning Cross-listed cs.CV, math.OC Citations 5 Venue arXiv.org Repository https://github.com/anandrajan0/smartalec โญ 2 Last Checked 3 months ago
Abstract
We seek to impose linear, equality constraints in feedforward neural networks. As top layer predictors are usually nonlinear, this is a difficult task if we seek to deploy standard convex optimization methods and strong duality. To overcome this, we introduce a new saddle-point Lagrangian with auxiliary predictor variables on which constraints are imposed. Elimination of the auxiliary variables leads to a dual minimization problem on the Lagrange multipliers introduced to satisfy the linear constraints. This minimization problem is combined with the standard learning problem on the weight matrices. From this theoretical line of development, we obtain the surprising interpretation of Lagrange parameters as additional, penultimate layer hidden units with fixed weights stemming from the constraints. Consequently, standard minimization approaches can be used despite the inclusion of Lagrange parameters -- a very satisfying, albeit unexpected, discovery. Examples ranging from multi-label classification to constrained autoencoders are envisaged in the future. The code has been made available at https://github.com/anandrajan0/smartalec
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning