R.I.P.
π»
Ghosted
RLOR: A Flexible Framework of Deep Reinforcement Learning for Operation Research
March 23, 2023 Β· Entered Twilight Β· π arXiv.org
Repo contents: .pre-commit-config.yaml, CITATION.cff, LICENSE, README.md, demo, environment.yml, envs, models, ppo.py, ppo_or.py, runs, wrappers
Authors
Ching Pui Wan, Tung Li, Jason Min Wang
arXiv ID
2303.13117
Category
math.OC: Optimization & Control
Cross-listed
cs.LG,
cs.NE
Citations
4
Venue
arXiv.org
Repository
https://github.com/cpwan/RLOR
β 126
Last Checked
3 months ago
Abstract
Reinforcement learning has been applied in operation research and has shown promise in solving large combinatorial optimization problems. However, existing works focus on developing neural network architectures for certain problems. These works lack the flexibility to incorporate recent advances in reinforcement learning, as well as the flexibility of customizing model architectures for operation research problems. In this work, we analyze the end-to-end autoregressive models for vehicle routing problems and show that these models can benefit from the recent advances in reinforcement learning with a careful re-implementation of the model architecture. In particular, we re-implemented the Attention Model and trained it with Proximal Policy Optimization (PPO) in CleanRL, showing at least 8 times speed up in training time. We hereby introduce RLOR, a flexible framework for Deep Reinforcement Learning for Operation Research. We believe that a flexible framework is key to developing deep reinforcement learning models for operation research problems. The code of our work is publicly available at https://github.com/cpwan/RLOR.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Optimization & Control
R.I.P.
π»
Ghosted
Local SGD Converges Fast and Communicates Little
R.I.P.
π»
Ghosted
On Lazy Training in Differentiable Programming
π
π
The Cartographer
A Review on Bilevel Optimization: From Classical to Evolutionary Approaches and Applications
R.I.P.
π»
Ghosted
Learned Primal-dual Reconstruction
R.I.P.
π»
Ghosted