DeInfoReg: A Decoupled Learning Framework for Better Training Throughput

June 22, 2025 ยท Entered Twilight ยท ๐Ÿ› arXiv.org

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: README.md, jsontocsv.py, nlp, requirements.txt, train_nlp.py, train_vision.py, utils, vision

Authors Zih-Hao Huang, You-Teng Lin, Hung-Hsuan Chen arXiv ID 2506.18193 Category cs.LG: Machine Learning Cross-listed cs.AI, cs.DC Citations 1 Venue arXiv.org Repository https://github.com/ianzih/Decoupled-Supervised-Learning-for-Information-Regularization/ โญ 1 Last Checked 4 months ago
Abstract
This paper introduces Decoupled Supervised Learning with Information Regularization (DeInfoReg), a novel approach that transforms a long gradient flow into multiple shorter ones, thereby mitigating the vanishing gradient problem. Integrating a pipeline strategy, DeInfoReg enables model parallelization across multiple GPUs, significantly improving training throughput. We compare our proposed method with standard backpropagation and other gradient flow decomposition techniques. Extensive experiments on diverse tasks and datasets demonstrate that DeInfoReg achieves superior performance and better noise resistance than traditional BP models and efficiently utilizes parallel computing resources. The code for reproducibility is available at: https://github.com/ianzih/Decoupled-Supervised-Learning-for-Information-Regularization/.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning