| 51 |
GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks
Zhao Chen, Vijay Badrinarayanan, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
1.7K |
8 years ago |
| 52 |
Constrained Policy Optimization
Joshua Achiam, David Held, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.6K |
8 years ago |
| 53 |
Which Training Methods for GANs do actually Converge?
Lars Mescheder, Andreas Geiger, Sebastian Nowozin
|
👻
Ghosted
|
cs.LG
|
1.6K |
8 years ago |
| 54 |
MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Lu Jiang, Zhengyuan Zhou, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
1.6K |
8 years ago |
| 55 |
A Convergence Theory for Deep Learning via Over-Parameterization
Zeyuan Allen-Zhu, Yuanzhi Li, Zhao Song
|
👻
Ghosted
|
cs.LG
|
1.6K |
7 years ago |
| 56 |
Junction Tree Variational Autoencoder for Molecular Graph Generation
Wengong Jin, Regina Barzilay, Tommi Jaakkola
|
👻
Ghosted
|
cs.LG
|
1.6K |
8 years ago |
| 57 |
Large-Margin Softmax Loss for Convolutional Neural Networks
Weiyang Liu, Yandong Wen, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
1.6K |
9 years ago |
| 58 |
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja, Haoran Tang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.5K |
9 years ago |
| 59 |
Train faster, generalize better: Stability of stochastic gradient descent
Moritz Hardt, Benjamin Recht, Yoram Singer
|
👻
Ghosted
|
cs.LG
|
1.4K |
10 years ago |
| 60 |
Black-box Adversarial Attacks with Limited Queries and Information
Andrew Ilyas, Logan Engstrom, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
1.3K |
8 years ago |
| 61 |
Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan, Matan Kalman, Yossi Matias
|
👻
Ghosted
|
cs.LG
|
1.2K |
3 years ago |
| 62 |
Compressing Neural Networks with the Hashing Trick
Wenlin Chen, James T. Wilson, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1.2K |
11 years ago |
| 63 |
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Noam Shazeer, Mitchell Stern
|
👻
Ghosted
|
cs.LG
|
1.2K |
8 years ago |
| 64 |
Overcoming catastrophic forgetting with hard attention to the task
Joan Serrà, Dídac Surís, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.2K |
8 years ago |
| 65 |
signSGD: Compressed Optimisation for Non-Convex Problems
Jeremy Bernstein, Yu-Xiang Wang, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
1.2K |
8 years ago |
| 66 |
Optimizing Neural Networks with Kronecker-factored Approximate Curvature
James Martens, Roger Grosse
|
👻
Ghosted
|
cs.LG
|
1.2K |
11 years ago |
| 67 |
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Ankit Kumar, Ozan Irsoy, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 68 |
Gradient Descent Finds Global Minima of Deep Neural Networks
Simon S. Du, Jason D. Lee, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1.2K |
7 years ago |
| 69 |
Ditto: Fair and Robust Federated Learning Through Personalization
Tian Li, Shengyuan Hu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.2K |
5 years ago |
| 70 |
Born Again Neural Networks
Tommaso Furlanello, Zachary C. Lipton, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
1.2K |
7 years ago |
| 71 |
Meta Networks
Tsendsuren Munkhdalai, Hong Yu
|
👻
Ghosted
|
cs.LG
|
1.1K |
9 years ago |
| 72 |
OptNet: Differentiable Optimization as a Layer in Neural Networks
Brandon Amos, J. Zico Kolter
|
👻
Ghosted
|
cs.LG
|
1.1K |
9 years ago |
| 73 |
Out-of-Distribution Generalization via Risk Extrapolation (REx)
David Krueger, Ethan Caballero, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
6 years ago |
| 74 |
MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing
Sami Abu-El-Haija, Bryan Perozzi, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
6 years ago |
| 75 |
Continuous Deep Q-Learning with Model-based Acceleration
Shixiang Gu, Timothy Lillicrap, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
10 years ago |
| 76 |
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks
Sanjeev Arora, Simon S. Du, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1.0K |
7 years ago |
| 77 |
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn, Sergey Levine, Pieter Abbeel
|
👻
Ghosted
|
cs.LG
|
1.0K |
10 years ago |
| 78 |
Toward Controlled Generation of Text
Zhiting Hu, Zichao Yang, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
1.0K |
9 years ago |
| 79 |
Gradient-based Hyperparameter Optimization through Reversible Learning
Dougal Maclaurin, David Duvenaud, Ryan P. Adams
|
👻
Ghosted
|
stat.ML
|
1.0K |
11 years ago |
| 80 |
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Kaitao Song, Xu Tan, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
6 years ago |
| 81 |
On Deep Multi-View Representation Learning: Objectives and Optimization
Weiran Wang, Raman Arora, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.0K |
10 years ago |
| 82 |
FeUdal Networks for Hierarchical Reinforcement Learning
Alexander Sasha Vezhnevets, Simon Osindero, ... (+5 more)
|
👻
Ghosted
|
cs.AI
|
1.0K |
9 years ago |
| 83 |
Data Shapley: Equitable Valuation of Data for Machine Learning
Amirata Ghorbani, James Zou
|
👻
Ghosted
|
stat.ML
|
995 |
7 years ago |
| 84 |
Texture Networks: Feed-forward Synthesis of Textures and Stylized Images
Dmitry Ulyanov, Vadim Lebedev, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
994 |
10 years ago |
| 85 |
Robust Adversarial Reinforcement Learning
Lerrel Pinto, James Davidson, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
985 |
9 years ago |
| 86 |
Implicit Geometric Regularization for Learning Shapes
Amos Gropp, Lior Yariv, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
978 |
6 years ago |
| 87 |
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son, Daewoo Kim, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
974 |
6 years ago |
| 88 |
Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering
Bo Yang, Xiao Fu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
963 |
9 years ago |
| 89 |
GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models
Jiaxuan You, Rex Ying, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
961 |
8 years ago |
| 90 |
The Hidden Vulnerability of Distributed Learning in Byzantium
El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault
|
👻
Ghosted
|
stat.ML
|
940 |
8 years ago |
| 91 |
MADE: Masked Autoencoder for Distribution Estimation
Mathieu Germain, Karol Gregor, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
925 |
11 years ago |
| 92 |
How to Escape Saddle Points Efficiently
Chi Jin, Rong Ge, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
907 |
9 years ago |
| 93 |
Generative Moment Matching Networks
Yujia Li, Kevin Swersky, Richard Zemel
|
👻
Ghosted
|
cs.LG
|
899 |
11 years ago |
| 94 |
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal, Fei Sha
|
👻
Ghosted
|
cs.LG
|
896 |
7 years ago |
| 95 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang, Daisy Stanton, ... (+8 more)
|
👻
Ghosted
|
cs.CL
|
893 |
8 years ago |
| 96 |
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord, Yazhe Li, ... (+20 more)
|
👻
Ghosted
|
cs.LG
|
893 |
8 years ago |
| 97 |
Compressed Sensing using Generative Models
Ashish Bora, Ajil Jalal, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
893 |
9 years ago |
| 98 |
Gated Feedback Recurrent Neural Networks
Junyoung Chung, Caglar Gulcehre, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
884 |
11 years ago |
| 99 |
Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness
Michael Kearns, Seth Neel, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
866 |
8 years ago |
| 100 |
A Theoretical Analysis of Contrastive Unsupervised Representation Learning
Sanjeev Arora, Hrishikesh Khandeparkar, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
859 |
7 years ago |