| 1 |
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe, Christian Szegedy
|
👻
Ghosted
|
cs.LG
|
46.0K |
11 years ago |
| 2 |
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn, Pieter Abbeel, Sergey Levine
|
🌅
Old Age
|
cs.LG
|
13.8K |
9 years ago |
| 3 |
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Yarin Gal, Zoubin Ghahramani
|
👻
Ghosted
|
stat.ML
|
11.0K |
10 years ago |
| 4 |
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Kelvin Xu, Jimmy Ba, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
10.6K |
11 years ago |
| 5 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja, Aurick Zhou, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
10.4K |
8 years ago |
| 6 |
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih, Adrià Puigdomènech Badia, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
9.7K |
10 years ago |
| 7 |
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Sohl-Dickstein, Eric A. Weiss, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
9.2K |
11 years ago |
| 8 |
Neural Message Passing for Quantum Chemistry
Justin Gilmer, Samuel S. Schoenholz, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
8.6K |
9 years ago |
| 9 |
Training data-efficient image transformers & distillation through attention
Hugo Touvron, Matthieu Cord, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
8.5K |
5 years ago |
| 10 |
Trust Region Policy Optimization
John Schulman, Sergey Levine, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
7.6K |
11 years ago |
| 11 |
Axiomatic Attribution for Deep Networks
Mukund Sundararajan, Ankur Taly, Qiqi Yan
|
👻
Ghosted
|
cs.LG
|
7.3K |
9 years ago |
| 12 |
On Calibration of Modern Neural Networks
Chuan Guo, Geoff Pleiss, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
7.2K |
8 years ago |
| 13 |
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto, Herke van Hoof, David Meger
|
👻
Ghosted
|
cs.AI
|
6.4K |
8 years ago |
| 14 |
Robust Speech Recognition via Large-Scale Weak Supervision
Alec Radford, Jong Wook Kim, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
6.1K |
3 years ago |
| 15 |
Learning Transferable Features with Deep Adaptation Networks
Mingsheng Long, Yue Cao, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
5.7K |
11 years ago |
| 16 |
Variational Inference with Normalizing Flows
Danilo Jimenez Rezende, Shakir Mohamed
|
👻
Ghosted
|
stat.ML
|
4.7K |
10 years ago |
| 17 |
Learning Important Features Through Propagating Activation Differences
Avanti Shrikumar, Peyton Greenside, Anshul Kundaje
|
👻
Ghosted
|
cs.CV
|
4.4K |
9 years ago |
| 18 |
Dueling Network Architectures for Deep Reinforcement Learning
Ziyu Wang, Tom Schaul, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
4.2K |
10 years ago |
| 19 |
Convolutional Sequence to Sequence Learning
Jonas Gehring, Michael Auli, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
3.5K |
8 years ago |
| 20 |
Conditional Image Synthesis With Auxiliary Classifier GANs
Augustus Odena, Christopher Olah, Jonathon Shlens
|
👻
Ghosted
|
stat.ML
|
3.4K |
9 years ago |
| 21 |
Complex Embeddings for Simple Link Prediction
Théo Trouillon, Johannes Welbl, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
3.4K |
9 years ago |
| 22 |
Understanding Black-box Predictions via Influence Functions
Pang Wei Koh, Percy Liang
|
👻
Ghosted
|
stat.ML
|
3.4K |
9 years ago |
| 23 |
Generative Adversarial Text to Image Synthesis
Scott Reed, Zeynep Akata, ... (+4 more)
|
👻
Ghosted
|
cs.NE
|
3.4K |
9 years ago |
| 24 |
Unsupervised Deep Embedding for Clustering Analysis
Junyuan Xie, Ross Girshick, Ali Farhadi
|
👻
Ghosted
|
cs.LG
|
3.3K |
10 years ago |
| 25 |
CyCADA: Cycle-Consistent Adversarial Domain Adaptation
Judy Hoffman, Eric Tzeng, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
3.2K |
8 years ago |
| 26 |
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei, Rishita Anubhai, ... (+32 more)
|
👻
Ghosted
|
cs.CL
|
3.1K |
10 years ago |
| 27 |
Efficient Neural Architecture Search via Parameter Sharing
Hieu Pham, Melody Y. Guan, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
2.9K |
8 years ago |
| 28 |
Pixel Recurrent Neural Networks
Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu
|
👻
Ghosted
|
cs.CV
|
2.8K |
10 years ago |
| 29 |
Language Modeling with Gated Convolutional Networks
Yann N. Dauphin, Angela Fan, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.7K |
9 years ago |
| 30 |
Unsupervised Learning of Video Representations using LSTMs
Nitish Srivastava, Elman Mansimov, Ruslan Salakhutdinov
|
👻
Ghosted
|
cs.LG
|
2.7K |
11 years ago |
| 31 |
Deep Transfer Learning with Joint Adaptation Networks
Mingsheng Long, Han Zhu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.7K |
9 years ago |
| 32 |
Revisiting Semi-Supervised Learning with Graph Embeddings
Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov
|
👻
Ghosted
|
cs.LG
|
2.4K |
10 years ago |
| 33 |
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang, Yao Zhao, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.3K |
6 years ago |
| 34 |
Representation Learning on Graphs with Jumping Knowledge Networks
Keyulu Xu, Chengtao Li, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
2.3K |
7 years ago |
| 35 |
Learning Convolutional Neural Networks for Graphs
Mathias Niepert, Mohamed Ahmed, Konstantin Kutzkov
|
👻
Ghosted
|
cs.LG
|
2.3K |
9 years ago |
| 36 |
Group Equivariant Convolutional Networks
Taco S. Cohen, Max Welling
|
👻
Ghosted
|
cs.LG
|
2.2K |
10 years ago |
| 37 |
Autoencoding beyond pixels using a learned similarity metric
Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.2K |
10 years ago |
| 38 |
Deep Learning with Limited Numerical Precision
Suyog Gupta, Ankur Agrawal, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.1K |
11 years ago |
| 39 |
A Closer Look at Memorization in Deep Networks
Devansh Arpit, Stanisław Jastrzębski, ... (+9 more)
|
👻
Ghosted
|
stat.ML
|
2.1K |
8 years ago |
| 40 |
Learning to Discover Cross-Domain Relations with Generative Adversarial Networks
Taeksoo Kim, Moonsu Cha, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
2.1K |
9 years ago |
| 41 |
DRAW: A Recurrent Neural Network For Image Generation
Karol Gregor, Ivo Danihelka, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
2.0K |
11 years ago |
| 42 |
Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates
Dong Yin, Yudong Chen, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.0K |
8 years ago |
| 43 |
Deep Bayesian Active Learning with Image Data
Yarin Gal, Riashat Islam, Zoubin Ghahramani
|
👻
Ghosted
|
cs.LG
|
1.9K |
9 years ago |
| 44 |
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto, David Meger, Doina Precup
|
👻
Ghosted
|
cs.LG
|
1.9K |
7 years ago |
| 45 |
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan, Xi Chen, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
1.8K |
9 years ago |
| 46 |
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Lasse Espeholt, Hubert Soyer, ... (+10 more)
|
👻
Ghosted
|
cs.LG
|
1.8K |
8 years ago |
| 47 |
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare, Will Dabney, Rémi Munos
|
👻
Ghosted
|
cs.LG
|
1.7K |
8 years ago |
| 48 |
Large-Scale Evolution of Image Classifiers
Esteban Real, Sherry Moore, ... (+6 more)
|
👻
Ghosted
|
cs.NE
|
1.7K |
9 years ago |
| 49 |
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner, Timothy Lillicrap, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
1.7K |
7 years ago |
| 50 |
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations
Francesco Locatello, Stefan Bauer, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
1.7K |
7 years ago |