๐
๐
Old Age
Benchmark for Uncertainty & Robustness in Self-Supervised Learning
December 23, 2022 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: .gitattributes, .gitignore, LICENSE, README.md, algorithms, datasets, deploy.sh, gallery, main.py, requirements.txt, setup.sh, train.sh, utils
Authors
Ha Manh Bui, Iliana Maifeld-Carucci
arXiv ID
2212.12411
Category
cs.CV: Computer Vision
Cross-listed
cs.LG
Citations
1
Venue
arXiv.org
Repository
https://github.com/hamanhbui/reliable_ssl_baselines
โญ 3
Last Checked
3 months ago
Abstract
Self-Supervised Learning (SSL) is crucial for real-world applications, especially in data-hungry domains such as healthcare and self-driving cars. In addition to a lack of labeled data, these applications also suffer from distributional shifts. Therefore, an SSL method should provide robust generalization and uncertainty estimation in the test dataset to be considered a reliable model in such high-stakes domains. However, existing approaches often focus on generalization, without evaluating the model's uncertainty. The ability to compare SSL techniques for improving these estimates is therefore critical for research on the reliability of self-supervision models. In this paper, we explore variants of SSL methods, including Jigsaw Puzzles, Context, Rotation, Geometric Transformations Prediction for vision, as well as BERT and GPT for language tasks. We train SSL in auxiliary learning for vision and pre-training for language model, then evaluate the generalization (in-out classification accuracy) and uncertainty (expected calibration error) across different distribution covariate shift datasets, including MNIST-C, CIFAR-10-C, CIFAR-10.1, and MNLI. Our goal is to create a benchmark with outputs from experiments, providing a starting point for new SSL methods in Reliable Machine Learning. All source code to reproduce results is available at https://github.com/hamanhbui/reliable_ssl_baselines.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computer Vision
๐
๐
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
๐
๐
Old Age
SSD: Single Shot MultiBox Detector
๐
๐
Old Age
Squeeze-and-Excitation Networks
๐
๐
Old Age
Fast R-CNN
๐
๐
Old Age