Exploring Memory Persistency Models for GPUs

April 24, 2019 · Declared Dead · 🏛 International Conference on Parallel Architectures and Compilation Techniques

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Zhen Lin, Mohammad Alshboul, Yan Solihin, Huiyang Zhou arXiv ID 1904.12661 Category cs.DC: Distributed Computing Citations 12 Venue International Conference on Parallel Architectures and Compilation Techniques Last Checked 4 months ago

Abstract

Given its high integration density, high speed, byte addressability, and low standby power, non-volatile or persistent memory is expected to supplement/replace DRAM as main memory. Through persistency programming models (which define durability ordering of stores) and durable transaction constructs, the programmer can provide recoverable data structure (RDS) which allows programs to recover to a consistent state after a failure. While persistency models have been well studied for CPUs, they have been neglected for graphics processing units (GPUs). Considering the importance of GPUs as a dominant accelerator for high performance computing, we investigate persistency models for GPUs. GPU applications exhibit substantial differences with CPUs applications, hence in this paper we adapt, re-architect, and optimize CPU persistency models for GPUs. We design a pragma-based compiler scheme to express persistency models for GPUs. We identify that the thread hierarchy in GPUs offers intuitive scopes to form epochs and durable transactions. We find that undo logging produces significant performance overheads. We propose to use idempotency analysis to reduce both logging frequency and the size of logs. Through both real-system and simulation evaluations, we show low overheads of our proposed architecture support.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Distributed Computing

R.I.P. 👻 Ghosted

Hyperledger Fabric: A Distributed Operating System for Permissioned Blockchains

Elli Androulaki, Artem Barger, ... (+19 more)

cs.DC 🏛 European Conference on Computer Systems 📚 4.0K cites 8 years ago

R.I.P. 👻 Ghosted

Reproducing GW150914: the first observation of gravitational waves from a binary black hole merger

Duncan A. Brown, Karan Vahi, ... (+3 more)

cs.DC 🏛 Computing in science & engineering (Print) 📚 2.3K cites 5 years ago

R.I.P. 👻 Ghosted

MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems

Tianqi Chen, Mu Li, ... (+8 more)

cs.DC 🏛 arXiv 📚 2.3K cites 10 years ago

R.I.P. 👻 Ghosted

Adaptive Federated Learning in Resource Constrained Edge Computing Systems

Shiqiang Wang, Tiffany Tuor, ... (+5 more)

cs.DC 🏛 IEEE JSAC 📚 2.0K cites 8 years ago

R.I.P. 👻 Ghosted

Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing

Zhi Zhou, Xu Chen, ... (+4 more)

cs.DC 🏛 Proc. IEEE 📚 1.7K cites 7 years ago

R.I.P. 👻 Ghosted

iFogSim: A Toolkit for Modeling and Simulation of Resource Management Techniques in Internet of Things, Edge and Fog Computing Environments

Harshit Gupta, Amir Vahid Dastjerdi, ... (+2 more)

cs.DC 🏛 Software, Practice & Experience 📚 1.5K cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago