Beyond Gradient Descent for Regularized Segmentation Losses

September 07, 2018 · Entered Twilight · 🏛 arXiv.org

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, deeplab

Authors Dmitrii Marin, Meng Tang, Ismail Ben Ayed, Yuri Boykov arXiv ID 1809.02322 Category cs.LG: Machine Learning Cross-listed stat.ML Citations 1 Venue arXiv.org Repository https://github.com/dmitrii-marin/adm-seg ⭐ 11 Last Checked 4 months ago

Abstract

The simplicity of gradient descent (GD) made it the default method for training ever-deeper and complex neural networks. Both loss functions and architectures are often explicitly tuned to be amenable to this basic local optimization. In the context of weakly-supervised CNN segmentation, we demonstrate a well-motivated loss function where an alternative optimizer (ADM) achieves the state-of-the-art while GD performs poorly. Interestingly, GD obtains its best result for a "smoother" tuning of the loss function. The results are consistent across different network architectures. Our loss is motivated by well-understood MRF/CRF regularization models in "shallow" segmentation and their known global solvers. Our work suggests that network design/training should pay more attention to optimization methods.