DIGEST: Fast and Communication Efficient Decentralized Learning with Local Updates

July 14, 2023 · Declared Dead · 🏛 IEEE Transactions on Machine Learning in Communications and Networking

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Peyman Gholami, Hulya Seferoglu arXiv ID 2307.07652 Category cs.LG: Machine Learning Cross-listed cs.DC Citations 17 Venue IEEE Transactions on Machine Learning in Communications and Networking Last Checked 4 months ago

Abstract

Two widely considered decentralized learning algorithms are Gossip and random walk-based learning. Gossip algorithms (both synchronous and asynchronous versions) suffer from high communication cost, while random-walk based learning experiences increased convergence time. In this paper, we design a fast and communication-efficient asynchronous decentralized learning mechanism DIGEST by taking advantage of both Gossip and random-walk ideas, and focusing on stochastic gradient descent (SGD). DIGEST is an asynchronous decentralized algorithm building on local-SGD algorithms, which are originally designed for communication efficient centralized learning. We design both single-stream and multi-stream DIGEST, where the communication overhead may increase when the number of streams increases, and there is a convergence and communication overhead trade-off which can be leveraged. We analyze the convergence of single- and multi-stream DIGEST, and prove that both algorithms approach to the optimal solution asymptotically for both iid and non-iid data distributions. We evaluate the performance of single- and multi-stream DIGEST for logistic regression and a deep neural network ResNet20. The simulation results confirm that multi-stream DIGEST has nice convergence properties; i.e., its convergence time is better than or comparable to the baselines in iid setting, and outperforms the baselines in non-iid setting.