๐ฎ
๐ฎ
The Ethereal
DeInfoReg: A Decoupled Learning Framework for Better Training Throughput
June 22, 2025 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: README.md, jsontocsv.py, nlp, requirements.txt, train_nlp.py, train_vision.py, utils, vision
Authors
Zih-Hao Huang, You-Teng Lin, Hung-Hsuan Chen
arXiv ID
2506.18193
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.DC
Citations
1
Venue
arXiv.org
Repository
https://github.com/ianzih/Decoupled-Supervised-Learning-for-Information-Regularization/
โญ 1
Last Checked
4 months ago
Abstract
This paper introduces Decoupled Supervised Learning with Information Regularization (DeInfoReg), a novel approach that transforms a long gradient flow into multiple shorter ones, thereby mitigating the vanishing gradient problem. Integrating a pipeline strategy, DeInfoReg enables model parallelization across multiple GPUs, significantly improving training throughput. We compare our proposed method with standard backpropagation and other gradient flow decomposition techniques. Extensive experiments on diverse tasks and datasets demonstrate that DeInfoReg achieves superior performance and better noise resistance than traditional BP models and efficiently utilizes parallel computing resources. The code for reproducibility is available at: https://github.com/ianzih/Decoupled-Supervised-Learning-for-Information-Regularization/.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal