Decoupled Mixup for Generalized Visual Recognition

October 26, 2022 ยท Entered Twilight ยท ๐Ÿ› arXiv.org

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: README.md, WSSS, data, data_find.py, dataset_json, environment.yaml, evaluation.py, find_data.sh, find_data_with_mask.sh, main.py, moco, pipeline.png, pydensecrf-master.zip, results, run_densenet121_track1.sh, run_densenet121_track2.sh, run_ensemble_track1.sh, run_ensemble_track2.sh, run_moco_track1.sh, run_moco_track2.sh, run_resnet34_track1.sh, run_resnet34_track2.sh, run_train_track1.sh, run_train_track2.sh, run_wide_resnet50_2_track1.sh, run_wide_resnet50_2_track2.sh, run_wsss_track1.sh, run_wsss_track2.sh, utils

Authors Haozhe Liu, Wentian Zhang, Jinheng Xie, Haoqian Wu, Bing Li, Ziqi Zhang, Yuexiang Li, Yawen Huang, Bernard Ghanem, Yefeng Zheng arXiv ID 2210.14783 Category cs.CV: Computer Vision Citations 1 Venue arXiv.org Repository https://github.com/HaozheLiu-ST/NICOChallenge-OOD-Classification โญ 9 Last Checked 3 months ago
Abstract
Convolutional neural networks (CNN) have demonstrated remarkable performance when the training and testing data are from the same distribution. However, such trained CNN models often largely degrade on testing data which is unseen and Out-Of-the-Distribution (OOD). To address this issue, we propose a novel "Decoupled-Mixup" method to train CNN models for OOD visual recognition. Different from previous work combining pairs of images homogeneously, our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combines these regions of image pairs to train CNN models. Since the observation is that noise-prone regions such as textural and clutter backgrounds are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair. To further improve the generalization ability of trained models, we propose to disentangle discriminative and noise-prone regions in frequency-based and context-based fashions. Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts, where our method achieves 85.76\% top-1 accuracy in Track-1 and 79.92\% in Track-2 in the NICO Challenge. The source code is available at https://github.com/HaozheLiu-ST/NICOChallenge-OOD-Classification.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision

๐ŸŒ… ๐ŸŒ… Old Age

Fast R-CNN

Ross Girshick

cs.CV ๐Ÿ› ICCV ๐Ÿ“š 27.7K cites 11 years ago