Efficient and Robust Jet Tagging at the LHC with Knowledge Distillation
November 23, 2023 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Ryan Liu, Abhijith Gandrakota, Jennifer Ngadiuba, Maria Spiropulu, Jean-Roch Vlimant
arXiv ID
2311.14160
Category
hep-ex
Cross-listed
cs.LG
Citations
3
Venue
arXiv.org
Last Checked
3 months ago
Abstract
The challenging environment of real-time data processing systems at the Large Hadron Collider (LHC) strictly limits the computational complexity of algorithms that can be deployed. For deep learning models, this implies that only models with low computational complexity that have weak inductive bias are feasible. To address this issue, we utilize knowledge distillation to leverage both the performance of large models and the reduced computational complexity of small ones. In this paper, we present an implementation of knowledge distillation, demonstrating an overall boost in the student models' performance for the task of classifying jets at the LHC. Furthermore, by using a teacher model with a strong inductive bias of Lorentz symmetry, we show that we can induce the same inductive bias in the student model which leads to better robustness against arbitrary Lorentz boost.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β hep-ex
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Parameterized Machine Learning for High-Energy Physics
R.I.P.
π»
Ghosted
A Convolutional Neural Network Neutrino Event Classifier
R.I.P.
π»
Ghosted
Variational Autoencoders for New Physics Mining at the Large Hadron Collider
R.I.P.
π»
Ghosted
Jet Constituents for Deep Neural Network Based Top Quark Tagging
R.I.P.
π»
Ghosted
Long Short-Term Memory (LSTM) networks with jet constituents for boosted top tagging at the LHC
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted