SURFing to the Fundamental Limit of Jet Tagging
November 19, 2025 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Ian Pang, Darius A. Faroughy, David Shih, Ranit Das, Gregor Kasieczka
arXiv ID
2511.15779
Category
hep-ph
Cross-listed
cs.LG,
hep-ex,
physics.data-an
Citations
2
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Beyond the practical goal of improving search and measurement sensitivity through better jet tagging algorithms, there is a deeper question: what are their upper performance limits? Generative surrogate models with learned likelihood functions offer a new approach to this problem, provided the surrogate correctly captures the underlying data distribution. In this work, we introduce the SUrrogate ReFerence (SURF) method, a new approach to validating generative models. This framework enables exact Neyman-Pearson tests by training the target model on samples from another tractable surrogate, which is itself trained on real data. We argue that the EPiC-FM generative model is a valid surrogate reference for JetClass jets and apply SURF to show that modern jet taggers may already be operating close to the true statistical limit. By contrast, we find that autoregressive GPT models unphysically exaggerate top vs. QCD separation power encoded in the surrogate reference, implying that they are giving a misleading picture of the fundamental limit.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β hep-ph
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds
R.I.P.
π»
Ghosted
An unfolding method based on conditional Invertible Neural Networks (cINN) using iterative training
R.I.P.
π»
Ghosted
PELICAN: Permutation Equivariant and Lorentz Invariant or Covariant Aggregator Network for Particle Physics
R.I.P.
π»
Ghosted
Stacking machine learning classifiers to identify Higgs bosons at the LHC
R.I.P.
π»
Ghosted
The Power of Genetic Algorithms: what remains of the pMSSM?
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted