| 1 |
Attention Is All You Need
Ashish Vaswani, Noam Shazeer, ... (+6 more)
|
🌅
Old Age
|
cs.CL
|
166.0K |
8 years ago |
| 2 |
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi, Zhourong Chen, ... (+4 more)
|
🏛️
Transcended
|
cs.CV
|
9.2K |
10 years ago |
| 3 |
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang, Zihang Dai, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
9.2K |
6 years ago |
| 4 |
Matching Networks for One Shot Learning
Oriol Vinyals, Charles Blundell, ... (+3 more)
|
🏛️
Transcended
|
cs.LG
|
8.1K |
9 years ago |
| 5 |
Spatial Transformer Networks
Max Jaderberg, Karen Simonyan, ... (+2 more)
|
🏛️
Transcended
|
cs.CV
|
7.9K |
11 years ago |
| 6 |
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan, Alexander Pritzel, Charles Blundell
|
🏛️
Transcended
|
stat.ML
|
7.0K |
9 years ago |
| 7 |
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Jifeng Dai, Yi Li, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
6.0K |
10 years ago |
| 8 |
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?
Alex Kendall, Yarin Gal
|
🏛️
Transcended
|
cs.CV
|
5.5K |
9 years ago |
| 9 |
Equality of Opportunity in Supervised Learning
Moritz Hardt, Eric Price, Nathan Srebro
|
👻
Ghosted
|
cs.LG
|
4.9K |
9 years ago |
| 10 |
Convolutional Networks on Graphs for Learning Molecular Fingerprints
David Duvenaud, Dougal Maclaurin, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
3.6K |
10 years ago |
| 11 |
Generative Adversarial Imitation Learning
Jonathan Ho, Stefano Ermon
|
👻
Ghosted
|
cs.LG
|
3.5K |
9 years ago |
| 12 |
Glow: Generative Flow with Invertible 1x1 Convolutions
Diederik P. Kingma, Prafulla Dhariwal
|
🌅
Old Age
|
stat.ML
|
3.5K |
7 years ago |
| 13 |
Pointer Networks
Oriol Vinyals, Meire Fortunato, Navdeep Jaitly
|
👻
Ghosted
|
stat.ML
|
3.3K |
10 years ago |
| 14 |
Attention-Based Models for Speech Recognition
Jan Chorowski, Dzmitry Bahdanau, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2.7K |
10 years ago |
| 15 |
Deep Leakage from Gradients
Ligeng Zhu, Zhijian Liu, Song Han
|
👻
Ghosted
|
cs.LG
|
2.7K |
6 years ago |
| 16 |
Unsupervised Data Augmentation for Consistency Training
Qizhe Xie, Zihang Dai, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
2.6K |
7 years ago |
| 17 |
Skip-Thought Vectors
Ryan Kiros, Yukun Zhu, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
2.5K |
10 years ago |
| 18 |
Learning Structured Sparsity in Deep Neural Networks
Wei Wen, Chunpeng Wu, ... (+3 more)
|
🌅
Old Age
|
cs.NE
|
2.5K |
9 years ago |
| 19 |
A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks
Kimin Lee, Kibok Lee, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
2.4K |
7 years ago |
| 20 |
Hierarchical Graph Representation Learning with Differentiable Pooling
Rex Ying, Jiaxuan You, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
2.4K |
7 years ago |
| 21 |
Continual Learning with Deep Generative Replay
Hanul Shin, Jung Kwon Lee, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
2.4K |
9 years ago |
| 22 |
Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks
Emily Denton, Soumith Chintala, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
2.3K |
10 years ago |
| 23 |
Sanity Checks for Saliency Maps
Julius Adebayo, Justin Gilmer, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
2.3K |
7 years ago |
| 24 |
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio, Oriol Vinyals, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.2K |
10 years ago |
| 25 |
Visualizing the Loss Landscape of Neural Nets
Hao Li, Zheng Xu, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
2.2K |
8 years ago |
| 26 |
Learning to learn by gradient descent by gradient descent
Marcin Andrychowicz, Misha Denil, ... (+6 more)
|
👻
Ghosted
|
cs.NE
|
2.2K |
9 years ago |
| 27 |
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling
Jiajun Wu, Chengkai Zhang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
2.1K |
9 years ago |
| 28 |
Understanding the Effective Receptive Field in Deep Convolutional Neural Networks
Wenjie Luo, Yujia Li, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
2.1K |
9 years ago |
| 29 |
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks
Tim Salimans, Diederik P. Kingma
|
👻
Ghosted
|
cs.LG
|
2.1K |
10 years ago |
| 30 |
Federated Multi-Task Learning
Virginia Smith, Chao-Kai Chiang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.0K |
9 years ago |
| 31 |
Adversarial Examples Are Not Bugs, They Are Features
Andrew Ilyas, Shibani Santurkar, ... (+4 more)
|
👻
Ghosted
|
stat.ML
|
2.0K |
7 years ago |
| 32 |
Improving Variational Inference with Inverse Autoregressive Flow
Diederik P. Kingma, Tim Salimans, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
2.0K |
9 years ago |
| 33 |
Tackling the Objective Inconsistency Problem in Heterogeneous Federated Optimization
Jianyu Wang, Qinghua Liu, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1.8K |
5 years ago |
| 34 |
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster, Yannis M. Assael, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
1.8K |
10 years ago |
| 35 |
Counterfactual Fairness
Matt J. Kusner, Joshua R. Loftus, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
1.8K |
9 years ago |
| 36 |
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
Sebastian Nowozin, Botond Cseke, Ryota Tomioka
|
👻
Ghosted
|
stat.ML
|
1.8K |
10 years ago |
| 37 |
Energy-based Out-of-distribution Detection
Weitang Liu, Xiaoyun Wang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.7K |
5 years ago |
| 38 |
Training Very Deep Networks
Rupesh Kumar Srivastava, Klaus Greff, Jürgen Schmidhuber
|
👻
Ghosted
|
cs.LG
|
1.7K |
10 years ago |
| 39 |
Hierarchical Question-Image Co-Attention for Visual Question Answering
Jiasen Lu, Jianwei Yang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
1.7K |
10 years ago |
| 40 |
Coupled Generative Adversarial Networks
Ming-Yu Liu, Oncel Tuzel
|
👻
Ghosted
|
cs.CV
|
1.7K |
9 years ago |
| 41 |
How Does Batch Normalization Help Optimization?
Shibani Santurkar, Dimitris Tsipras, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
1.7K |
8 years ago |
| 42 |
A simple neural network module for relational reasoning
Adam Santoro, David Raposo, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
1.7K |
9 years ago |
| 43 |
Learning Combinatorial Optimization Algorithms over Graphs
Hanjun Dai, Elias B. Khalil, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1.6K |
9 years ago |
| 44 |
Variational Dropout and the Local Reparameterization Trick
Diederik P. Kingma, Tim Salimans, Max Welling
|
👻
Ghosted
|
stat.ML
|
1.6K |
10 years ago |
| 45 |
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare, Sriram Srinivasan, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
1.6K |
9 years ago |
| 46 |
Gradient Surgery for Multi-Task Learning
Tianhe Yu, Saurabh Kumar, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
1.6K |
6 years ago |
| 47 |
Unsupervised Domain Adaptation with Residual Transfer Networks
Mingsheng Long, Han Zhu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.6K |
10 years ago |
| 48 |
Multi-Task Learning as Multi-Objective Optimization
Ozan Sener, Vladlen Koltun
|
👻
Ghosted
|
cs.LG
|
1.6K |
7 years ago |
| 49 |
Generating Videos with Scene Dynamics
Carl Vondrick, Hamed Pirsiavash, Antonio Torralba
|
👻
Ghosted
|
cs.CV
|
1.6K |
9 years ago |
| 50 |
Domain Separation Networks
Konstantinos Bousmalis, George Trigeorgis, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1.6K |
9 years ago |