Improving reproducibility by controlling random seed stability in machine learning based estimation via bagging

April 20, 2026 · Grace Period · + Add venue

Authors Nicholas Williams, Alejandro Schuler arXiv ID 2604.17694 Category stat.ME Cross-listed cs.LG, stat.ML Citations 0

Abstract

Predictions from machine learning algorithms can vary across random seeds, inducing instability in downstream debiased machine learning estimators. We formalize random seed stability via a concentration condition and prove that subbagging guarantees stability for any bounded-outcome regression algorithm. We introduce a new cross-fitting procedure, adaptive cross-bagging, which simultaneously eliminates seed dependence from both nuisance estimation and sample splitting in debiased machine learning. Numerical experiments confirm that the method achieves the targeted level of stability whereas alternatives do not. Our method incurs a small computational penalty relative to standard practice whereas alternative methods incur large penalties.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — stat.ME

R.I.P. 👻 Ghosted

Causal inference using invariant prediction: identification and confidence intervals

Jonas Peters, Peter Bühlmann, Nicolai Meinshausen

stat.ME 🏛 J.RSSSB 📚 1.1K cites 11 years ago

R.I.P. 👻 Ghosted

Performance Metrics (Error Measures) in Machine Learning Regression, Forecasting and Prognostics: Properties and Typology

Alexei Botchkarev

stat.ME 🏛 Interdisciplinary Journal of Information, Knowledge, and Management 📚 671 cites 7 years ago

R.I.P. 👻 Ghosted

External Validity: From Do-Calculus to Transportability Across Populations

Judea Pearl, Elias Bareinboim

stat.ME 🏛 Probabilistic and Causal Inference 📚 366 cites 11 years ago

R.I.P. 👻 Ghosted

Least Ambiguous Set-Valued Classifiers with Bounded Error Levels

Mauricio Sadinle, Jing Lei, Larry Wasserman

stat.ME 🏛 J.ASA 📚 318 cites 9 years ago

R.I.P. 👻 Ghosted

Doubly Robust Policy Evaluation and Optimization

Miroslav Dudík, Dumitru Erhan, ... (+2 more)

stat.ME 🏛 arXiv 📚 308 cites 11 years ago

R.I.P. 👻 Ghosted

Comparison of Bayesian predictive methods for model selection

Juho Piironen, Aki Vehtari

stat.ME 🏛 Statistics and computing 📚 304 cites 11 years ago