R.I.P.
๐ป
Ghosted
RTP: Rethinking Tensor Parallelism with Memory Deduplication
November 02, 2023 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: LICENSE, README.md, benchmarks, docs, requirements.txt, rtp, rtp_mnist.py, setup.py, test.py, test_net.py, test_transformer.py, tests, tutorials
Authors
Cheng Luo, Tianle Zhong, Geoffrey Fox
arXiv ID
2311.01635
Category
cs.DC: Distributed Computing
Cross-listed
cs.AI,
cs.NI
Citations
4
Venue
arXiv.org
Repository
https://github.com/wdlctc/rtp
โญ 11
Last Checked
3 months ago
Abstract
In the evolving landscape of neural network models, one prominent challenge stand out: the significant memory overheads associated with training expansive models. Addressing this challenge, this study delves deep into the Rotated Tensor Parallelism (RTP). RTP is an innovative approach that strategically focuses on memory deduplication in distributed training environments. It boasts of unique features like a customized communication primitive and the Flyweight Pattern initialization. Furthermore, RTP ensures a seamless overlap between partition computation and partition weight communication, optimizing the training process. Our empirical evaluations underscore RTP's efficiency, revealing that its memory consumption during distributed system training is remarkably close to the optimal - distributing the memory overhead of a single machine equitably among multiple machines. The experimental results demonstrate that RTP is capable of achieving comparable performance to Distributed Data Parallel while providing support for significantly larger models with near-linear scalability in terms of memory. Code of RTP is available at https://github.com/wdlctc/rtp.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Distributed Computing
R.I.P.
๐ป
Ghosted
Reproducing GW150914: the first observation of gravitational waves from a binary black hole merger
R.I.P.
๐ป
Ghosted
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
R.I.P.
๐ป
Ghosted
Adaptive Federated Learning in Resource Constrained Edge Computing Systems
R.I.P.
๐ป
Ghosted
Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing
R.I.P.
๐ป
Ghosted