Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate

October 23, 2023 · Declared Dead · 🏛 IEEE Transactions on Neural Networks and Learning Systems

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Pengfei Sun, Jibin Wu, Malu Zhang, Paul Devos, Dick Botteldooren arXiv ID 2310.14982 Category cs.NE: Neural & Evolutionary Cross-listed cs.LG, eess.AS, eess.SP Citations 19 Venue IEEE Transactions on Neural Networks and Learning Systems Last Checked 4 months ago

Abstract

Recurrent Neural Networks (RNNs) are widely recognized for their proficiency in modeling temporal dependencies, making them highly prevalent in sequential data processing applications. Nevertheless, vanilla RNNs are confronted with the well-known issue of gradient vanishing and exploding, posing a significant challenge for learning and establishing long-range dependencies. Additionally, gated RNNs tend to be over-parameterized, resulting in poor computational efficiency and network generalization. To address these challenges, this paper proposes a novel Delayed Memory Unit (DMU). The DMU incorporates a delay line structure along with delay gates into vanilla RNN, thereby enhancing temporal interaction and facilitating temporal credit assignment. Specifically, the DMU is designed to directly distribute the input information to the optimal time instant in the future, rather than aggregating and redistributing it over time through intricate network dynamics. Our proposed DMU demonstrates superior temporal modeling capabilities across a broad range of sequential modeling tasks, utilizing considerably fewer parameters than other state-of-the-art gated RNN models in applications such as speech recognition, radar gesture recognition, ECG waveform segmentation, and permuted sequential image classification.