R.I.P.
๐ป
Ghosted
Computron: Serving Distributed Deep Learning Models with Model Parallel Swapping
June 24, 2023 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: .gitignore, .gitmodules, LICENSE, README.md, alpa_serve, computron, energonai, examples, experiments, junkyard, playground, pyproject.toml
Authors
Daniel Zou, Xinchen Jin, Xueyang Yu, Hao Zhang, James Demmel
arXiv ID
2306.13835
Category
cs.DC: Distributed Computing
Cross-listed
cs.LG
Citations
1
Venue
arXiv.org
Repository
https://github.com/dlzou/computron
โญ 5
Last Checked
3 months ago
Abstract
Many of the most performant deep learning models today in fields like language and image understanding are fine-tuned models that contain billions of parameters. In anticipation of workloads that involve serving many of such large models to handle different tasks, we develop Computron, a system that uses memory swapping to serve multiple distributed models on a shared GPU cluster. Computron implements a model parallel swapping design that takes advantage of the aggregate CPU-GPU link bandwidth of a cluster to speed up model parameter transfers. This design makes swapping large models feasible and can improve resource utilization. We demonstrate that Computron successfully parallelizes model swapping on multiple GPUs, and we test it on randomized workloads to show how it can tolerate real world variability factors like burstiness and skewed request rates. Computron's source code is available at https://github.com/dlzou/computron.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Distributed Computing
R.I.P.
๐ป
Ghosted
Reproducing GW150914: the first observation of gravitational waves from a binary black hole merger
R.I.P.
๐ป
Ghosted
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
R.I.P.
๐ป
Ghosted
Adaptive Federated Learning in Resource Constrained Edge Computing Systems
R.I.P.
๐ป
Ghosted
Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing
R.I.P.
๐ป
Ghosted