An Empirical Comparison of Neural Architectures for Reinforcement Learning in Partially Observable Environments

December 17, 2015 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Denis Steckelmacher, Peter Vrancx arXiv ID 1512.05509 Category cs.NE: Neural & Evolutionary Cross-listed cs.AI, cs.LG Citations 4 Venue arXiv.org Last Checked 4 months ago

Abstract

This paper explores the performance of fitted neural Q iteration for reinforcement learning in several partially observable environments, using three recurrent neural network architectures: Long Short-Term Memory, Gated Recurrent Unit and MUT1, a recurrent neural architecture evolved from a pool of several thousands candidate architectures. A variant of fitted Q iteration, based on Advantage values instead of Q values, is also explored. The results show that GRU performs significantly better than LSTM and MUT1 for most of the problems considered, requiring less training episodes and less CPU time before learning a very good policy. Advantage learning also tends to produce better results.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Neural & Evolutionary

🔮 🔮 The Ethereal

LSTM: A Search Space Odyssey

Klaus Greff, Rupesh Kumar Srivastava, ... (+3 more)

cs.NE 🏛 IEEE TNNLS 📚 6.0K cites 11 years ago

R.I.P. 👻 Ghosted

Deep Learning using Rectified Linear Units (ReLU)

Abien Fred Agarap

cs.NE 🏛 arXiv 📚 3.8K cites 8 years ago

R.I.P. 👻 Ghosted

Generative Adversarial Text to Image Synthesis

Scott Reed, Zeynep Akata, ... (+4 more)

cs.NE 🏛 ICML 📚 3.4K cites 10 years ago

R.I.P. 👻 Ghosted

Regularized Evolution for Image Classifier Architecture Search

Esteban Real, Alok Aggarwal, ... (+2 more)

cs.NE 🏛 AAAI 📚 3.2K cites 8 years ago

R.I.P. 👻 Ghosted

Temporal Ensembling for Semi-Supervised Learning

Samuli Laine, Timo Aila

cs.NE 🏛 ICLR 📚 2.8K cites 9 years ago

🌅 🌅 Old Age

Learning Structured Sparsity in Deep Neural Networks

Wei Wen, Chunpeng Wu, ... (+3 more)

cs.NE 🏛 NeurIPS 📚 2.5K cites 9 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago