A survey of deep learning optimizers -- first and second order methods

November 28, 2022 ยท The Cartographer ยท ๐Ÿ› arXiv.org

๐Ÿ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper โ€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A survey of deep learning optimizers -- first and second order methods"

Evidence collected by the PWNC Scanner

Authors Rohan Kashyap arXiv ID 2211.15596 Category cs.LG: Machine Learning Cross-listed cs.CV, math.OC Citations 12 Venue arXiv.org Last Checked 3 days ago
Abstract
Deep Learning optimization involves minimizing a high-dimensional loss function in the weight space which is often perceived as difficult due to its inherent difficulties such as saddle points, local minima, ill-conditioning of the Hessian and limited compute resources. In this paper, we provide a comprehensive review of $14$ standard optimization methods successfully used in deep learning research and a theoretical assessment of the difficulties in numerical optimization from the optimization literature.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning