Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines

June 20, 2017 Β· Declared Dead Β· πŸ› arXiv.org

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Philip S. Thomas, Emma Brunskill arXiv ID 1706.06643 Category cs.AI: Artificial Intelligence Cross-listed cs.LG Citations 54 Venue arXiv.org Last Checked 4 months ago
Abstract
We show how an action-dependent baseline can be used by the policy gradient theorem using function approximation, originally presented with action-independent baselines by (Sutton et al. 2000).
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Artificial Intelligence

Died the same way β€” πŸ‘» Ghosted