Compressed Federated Reinforcement Learning with a Generative Model

March 26, 2024 · Declared Dead · 🏛 ECML/PKDD

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Ali Beikmohammadi, Sarit Khirirat, Sindri Magnússon arXiv ID 2404.10635 Category cs.DC: Distributed Computing Cross-listed cs.LG, cs.MA Citations 5 Venue ECML/PKDD Last Checked 4 months ago

Abstract

Reinforcement learning has recently gained unprecedented popularity, yet it still grapples with sample inefficiency. Addressing this challenge, federated reinforcement learning (FedRL) has emerged, wherein agents collaboratively learn a single policy by aggregating local estimations. However, this aggregation step incurs significant communication costs. In this paper, we propose CompFedRL, a communication-efficient FedRL approach incorporating both \textit{periodic aggregation} and (direct/error-feedback) compression mechanisms. Specifically, we consider compressed federated $Q$-learning with a generative model setup, where a central server learns an optimal $Q$-function by periodically aggregating compressed $Q$-estimates from local agents. For the first time, we characterize the impact of these two mechanisms (which have remained elusive) by providing a finite-time analysis of our algorithm, demonstrating strong convergence behaviors when utilizing either direct or error-feedback compression. Our bounds indicate improved solution accuracy concerning the number of agents and other federated hyperparameters while simultaneously reducing communication costs. To corroborate our theory, we also conduct in-depth numerical experiments to verify our findings, considering Top-$K$ and Sparsified-$K$ sparsification operators.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Distributed Computing

R.I.P. 👻 Ghosted

Hyperledger Fabric: A Distributed Operating System for Permissioned Blockchains

Elli Androulaki, Artem Barger, ... (+19 more)

cs.DC 🏛 European Conference on Computer Systems 📚 4.0K cites 8 years ago

R.I.P. 👻 Ghosted

Reproducing GW150914: the first observation of gravitational waves from a binary black hole merger

Duncan A. Brown, Karan Vahi, ... (+3 more)

cs.DC 🏛 Computing in science & engineering (Print) 📚 2.3K cites 5 years ago

R.I.P. 👻 Ghosted

MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems

Tianqi Chen, Mu Li, ... (+8 more)

cs.DC 🏛 arXiv 📚 2.3K cites 10 years ago

R.I.P. 👻 Ghosted

Adaptive Federated Learning in Resource Constrained Edge Computing Systems

Shiqiang Wang, Tiffany Tuor, ... (+5 more)

cs.DC 🏛 IEEE JSAC 📚 2.0K cites 8 years ago

R.I.P. 👻 Ghosted

Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing

Zhi Zhou, Xu Chen, ... (+4 more)

cs.DC 🏛 Proc. IEEE 📚 1.7K cites 7 years ago

R.I.P. 👻 Ghosted

iFogSim: A Toolkit for Modeling and Simulation of Resource Management Techniques in Internet of Things, Edge and Fog Computing Environments

Harshit Gupta, Amir Vahid Dastjerdi, ... (+2 more)

cs.DC 🏛 Software, Practice & Experience 📚 1.5K cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago