Improving compiler support for SIMD offload using Arm Streaming SVE

June 02, 2025 · Declared Dead · 🏛 Information Security Conference

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Mohamed Husain Noor Mohamed, Adarsh Patil, Latchesar Ionkov, Eric Van Hensbergen arXiv ID 2506.02233 Category cs.PL: Programming Languages Citations 0 Venue Information Security Conference Last Checked 4 months ago

Abstract

The wider adoption of tightly coupled core-adjacent accelerators, such as Arm Scalable Matrix Extension (SME), hinges on lowering software programming complexity. In this paper, we focus on enabling the use of SME architecture in Streaming Scalable Vector Extension (SSVE) mode for workloads written in C/C++. While current compilers optimize loops for all types of SIMD instructions, these techniques primarily target vector units within the core and falter when applied to disaggregated, core-adjacent SIMD accelerators. Our goal is to enable the compiler to automatically generate code for such accelerators only when profitable. To this end, we investigate a path towards performant, precise, and repeatable computation offloading through two compiler ecosystems. We revisit LLVM compiler passes, MLIR transforms and their associated cost models, and heuristics. We hope that these insights can provide directions for evolving compiler capabilities towards automatic code generation for this next-generation vector processing paradigm.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Programming Languages

R.I.P. 👻 Ghosted

Ascertaining Uncertainty for Efficient Exact Cache Analysis

Valentin Touzeau, Claire Maïza, ... (+2 more)

cs.PL 🏛 CAV 📚 816 cites 8 years ago

R.I.P. 👻 Ghosted

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

Nicolas Vasilache, Oleksandr Zinenko, ... (+7 more)

cs.PL 🏛 arXiv 📚 472 cites 8 years ago

R.I.P. 👻 Ghosted

Glow: Graph Lowering Compiler Techniques for Neural Networks

Nadav Rotem, Jordan Fix, ... (+16 more)

cs.PL 🏛 arXiv 📚 318 cites 8 years ago

R.I.P. 👻 Ghosted

Learnable Programming: Blocks and Beyond

David Bau, Jeff Gray, ... (+3 more)

cs.PL 🏛 CACM 📚 298 cites 9 years ago

R.I.P. 👻 Ghosted

Scenic: A Language for Scenario Specification and Scene Generation

Daniel J. Fremont, Tommaso Dreossi, ... (+4 more)

cs.PL 🏛 ACM-SIGPLAN Symposium on Programming Language Design and Implementation 📚 297 cites 7 years ago

R.I.P. 👻 Ghosted

Vandal: A Scalable Security Analysis Framework for Smart Contracts

Lexi Brent, Anton Jurisevic, ... (+6 more)

cs.PL 🏛 arXiv 📚 296 cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago