AVX-512 extension to OpenQCD 1.6
June 15, 2018 Β· Declared Dead Β· + Add venue
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Ed Bennett, Mark Dawson, Michele Mesiti, Jarno Rantaharju
arXiv ID
1806.06043
Category
hep-lat
Cross-listed
cs.DC
Citations
2
Last Checked
3 months ago
Abstract
We publish an extension of openQCD-1.6 with AVX-512 vector instructions using Intel intrinsics. Recent Intel processors support extended instruction sets with operations on 512-bit wide vectors, increasing both the capacity for floating point operations and register memory. Optimal use of the new capabilities requires reorganising data and floating point operations into these wider vector units. We report on the implementation and performance of the AVX-512 OpenQCD extension on clusters using Intel Knights Landing and Xeon Scalable (Skylake) CPUs. In complete HMC trajectories with physically relevant parameters we observe a performance increase of 5% to 10%.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β hep-lat
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Lattice gauge equivariant convolutional neural networks
R.I.P.
π»
Ghosted
Aspects of scaling and scalability for flow-based sampling of lattice QCD
R.I.P.
π»
Ghosted
Gauge Equivariant Neural Networks for 2+1D U(1) Gauge Theory Simulations in Hamiltonian Formulation
R.I.P.
π»
Ghosted
Job Management and Task Bundling
R.I.P.
π»
Ghosted
Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted