TanhSoft -- a family of activation functions combining Tanh and Softplus

September 08, 2020 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Koushik Biswas, Sandeep Kumar, Shilpak Banerjee, Ashish Kumar Pandey arXiv ID 2009.03863 Category cs.NE: Neural & Evolutionary Cross-listed cs.AI, cs.CV, cs.LG Citations 6 Venue arXiv.org Last Checked 4 months ago

Abstract

Deep learning at its core, contains functions that are composition of a linear transformation with a non-linear function known as activation function. In past few years, there is an increasing interest in construction of novel activation functions resulting in better learning. In this work, we propose a family of novel activation functions, namely TanhSoft, with four undetermined hyper-parameters of the form tanh(αx+βe^{γx})ln(δ+e^x) and tune these hyper-parameters to obtain activation functions which are shown to outperform several well known activation functions. For instance, replacing ReLU with xtanh(0.6e^x)improves top-1 classification accuracy on CIFAR-10 by 0.46% for DenseNet-169 and 0.7% for Inception-v3 while with tanh(0.87x)ln(1 +e^x) top-1 classification accuracy on CIFAR-100 improves by 1.24% for DenseNet-169 and 2.57% for SimpleNet model.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Neural & Evolutionary

🔮 🔮 The Ethereal

LSTM: A Search Space Odyssey

Klaus Greff, Rupesh Kumar Srivastava, ... (+3 more)

cs.NE 🏛 IEEE TNNLS 📚 6.0K cites 11 years ago

R.I.P. 👻 Ghosted

Deep Learning using Rectified Linear Units (ReLU)

Abien Fred Agarap

cs.NE 🏛 arXiv 📚 3.8K cites 8 years ago

R.I.P. 👻 Ghosted

Generative Adversarial Text to Image Synthesis

Scott Reed, Zeynep Akata, ... (+4 more)

cs.NE 🏛 ICML 📚 3.4K cites 10 years ago

R.I.P. 👻 Ghosted

Regularized Evolution for Image Classifier Architecture Search

Esteban Real, Alok Aggarwal, ... (+2 more)

cs.NE 🏛 AAAI 📚 3.2K cites 8 years ago

R.I.P. 👻 Ghosted

Temporal Ensembling for Semi-Supervised Learning

Samuli Laine, Timo Aila

cs.NE 🏛 ICLR 📚 2.8K cites 9 years ago

🌅 🌅 Old Age

Learning Structured Sparsity in Deep Neural Networks

Wei Wen, Chunpeng Wu, ... (+3 more)

cs.NE 🏛 NeurIPS 📚 2.5K cites 9 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago