| 251 |
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs
Alon Brutzkus, Amir Globerson
|
👻
Ghosted
|
cs.LG
|
316 |
9 years ago |
| 252 |
Control of Memory, Active Perception, and Action in Minecraft
Junhyuk Oh, Valliappa Chockalingam, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
316 |
9 years ago |
| 253 |
GMNN: Graph Markov Neural Networks
Meng Qu, Yoshua Bengio, Jian Tang
|
👻
Ghosted
|
cs.LG
|
314 |
6 years ago |
| 254 |
Stochastic modified equations and adaptive stochastic gradient algorithms
Qianxiao Li, Cheng Tai, Weinan E
|
👻
Ghosted
|
cs.LG
|
314 |
10 years ago |
| 255 |
PolyGen: An Autoregressive Generative Model of 3D Meshes
Charlie Nash, Yaroslav Ganin, ... (+2 more)
|
👻
Ghosted
|
cs.GR
|
313 |
6 years ago |
| 256 |
Transformer Quality in Linear Time
Weizhe Hua, Zihang Dai, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
312 |
4 years ago |
| 257 |
Sever: A Robust Meta-Algorithm for Stochastic Optimization
Ilias Diakonikolas, Gautam Kamath, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
309 |
8 years ago |
| 258 |
Learning Physical Intuition of Block Towers by Example
Adam Lerer, Sam Gross, Rob Fergus
|
👻
Ghosted
|
cs.AI
|
309 |
10 years ago |
| 259 |
Computational Optimal Transport: Complexity by Accelerated Gradient Descent Is Better Than by Sinkhorn's Algorithm
Pavel Dvurechensky, Alexander Gasnikov, Alexey Kroshnin
|
👻
Ghosted
|
cs.DS
|
306 |
8 years ago |
| 260 |
Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks
Vahid Behzadan, Arslan Munir
|
👻
Ghosted
|
cs.LG
|
304 |
9 years ago |
| 261 |
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Andrea Zanette, Emma Brunskill
|
👻
Ghosted
|
cs.LG
|
302 |
7 years ago |
| 262 |
Learned Optimizers that Scale and Generalize
Olga Wichrowska, Niru Maheswaranathan, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
302 |
9 years ago |
| 263 |
The Predictron: End-To-End Learning and Planning
David Silver, Hado van Hasselt, ... (+9 more)
|
👻
Ghosted
|
cs.LG
|
302 |
9 years ago |
| 264 |
Noisy Activation Functions
Caglar Gulcehre, Marcin Moczulski, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
302 |
10 years ago |
| 265 |
PixelSNAIL: An Improved Autoregressive Generative Model
Xi Chen, Nikhil Mishra, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
300 |
8 years ago |
| 266 |
Provably Efficient Exploration in Policy Optimization
Qi Cai, Zhuoran Yang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
299 |
6 years ago |
| 267 |
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr H. Pong, Murtaza Dalal, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
299 |
7 years ago |
| 268 |
Online Meta-Learning
Chelsea Finn, Aravind Rajeswaran, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
299 |
7 years ago |
| 269 |
Gromov-Wasserstein Learning for Graph Matching and Node Embedding
Hongteng Xu, Dixin Luo, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
299 |
7 years ago |
| 270 |
The Mechanics of n-Player Differentiable Games
David Balduzzi, Sebastien Racaniere, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
299 |
8 years ago |
| 271 |
Unsupervised Learning by Predicting Noise
Piotr Bojanowski, Armand Joulin
|
👻
Ghosted
|
stat.ML
|
299 |
9 years ago |
| 272 |
The case for 4-bit precision: k-bit Inference Scaling Laws
Tim Dettmers, Luke Zettlemoyer
|
👻
Ghosted
|
cs.LG
|
299 |
3 years ago |
| 273 |
Variational inference for Monte Carlo objectives
Andriy Mnih, Danilo J. Rezende
|
👻
Ghosted
|
cs.LG
|
298 |
10 years ago |
| 274 |
The loss surface of deep and wide neural networks
Quynh Nguyen, Matthias Hein
|
👻
Ghosted
|
cs.LG
|
297 |
9 years ago |
| 275 |
Understanding the impact of entropy on policy optimization
Zafarali Ahmed, Nicolas Le Roux, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
295 |
7 years ago |
| 276 |
AdaNet: Adaptive Structural Learning of Artificial Neural Networks
Corinna Cortes, Xavi Gonzalvo, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
295 |
9 years ago |
| 277 |
Learning to Exploit Long-term Relational Dependencies in Knowledge Graphs
Lingbing Guo, Zequn Sun, Wei Hu
|
👻
Ghosted
|
cs.AI
|
294 |
6 years ago |
| 278 |
Cascading Bandits: Learning to Rank in the Cascade Model
Branislav Kveton, Csaba Szepesvari, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
294 |
11 years ago |
| 279 |
A Kronecker-factored approximate Fisher matrix for convolution layers
Roger Grosse, James Martens
|
👻
Ghosted
|
stat.ML
|
292 |
10 years ago |
| 280 |
Compositional Fairness Constraints for Graph Embeddings
Avishek Joey Bose, William L. Hamilton
|
👻
Ghosted
|
cs.LG
|
291 |
6 years ago |
| 281 |
Efficient softmax approximation for GPUs
Edouard Grave, Armand Joulin, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
290 |
9 years ago |
| 282 |
Training Neural Networks Without Gradients: A Scalable ADMM Approach
Gavin Taylor, Ryan Burmeister, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
289 |
9 years ago |
| 283 |
Analyzing Uncertainty in Neural Machine Translation
Myle Ott, Michael Auli, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
288 |
8 years ago |
| 284 |
Poisoning Language Models During Instruction Tuning
Alexander Wan, Eric Wallace, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
285 |
2 years ago |
| 285 |
Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam
Mohammad Emtiyaz Khan, Didrik Nielsen, ... (+4 more)
|
👻
Ghosted
|
stat.ML
|
284 |
7 years ago |
| 286 |
On the Power of Over-parametrization in Neural Networks with Quadratic Activation
Simon S. Du, Jason D. Lee
|
👻
Ghosted
|
cs.LG
|
284 |
8 years ago |
| 287 |
Rademacher Complexity for Adversarially Robust Generalization
Dong Yin, Kannan Ramchandran, Peter Bartlett
|
👻
Ghosted
|
cs.LG
|
282 |
7 years ago |
| 288 |
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
Junhyuk Oh, Satinder Singh, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
282 |
8 years ago |
| 289 |
Equivariance Through Parameter-Sharing
Siamak Ravanbakhsh, Jeff Schneider, Barnabas Poczos
|
👻
Ghosted
|
stat.ML
|
281 |
9 years ago |
| 290 |
Graying the black box: Understanding DQNs
Tom Zahavy, Nir Ben Zrihem, Shie Mannor
|
👻
Ghosted
|
cs.LG
|
281 |
10 years ago |
| 291 |
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar, Yinlam Chow, Mohammad Ghavamzadeh
|
👻
Ghosted
|
cs.AI
|
280 |
8 years ago |
| 292 |
Self-Imitation Learning
Junhyuk Oh, Yijie Guo, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
279 |
7 years ago |
| 293 |
Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Itay Safran, Ohad Shamir
|
👻
Ghosted
|
cs.LG
|
279 |
8 years ago |
| 294 |
Bounding and Counting Linear Regions of Deep Neural Networks
Thiago Serra, Christian Tjandraatmadja, Srikumar Ramalingam
|
👻
Ghosted
|
cs.LG
|
278 |
8 years ago |
| 295 |
A Laplacian Framework for Option Discovery in Reinforcement Learning
Marlos C. Machado, Marc G. Bellemare, Michael Bowling
|
👻
Ghosted
|
cs.LG
|
278 |
9 years ago |
| 296 |
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband, Benjamin Van Roy
|
👻
Ghosted
|
stat.ML
|
278 |
9 years ago |
| 297 |
Memory-Efficient Pipeline-Parallel DNN Training
Deepak Narayanan, Amar Phanishayee, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
274 |
5 years ago |
| 298 |
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel, Minh-Thang Luong, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
273 |
9 years ago |
| 299 |
Low Latency Privacy Preserving Inference
Alon Brutzkus, Oren Elisha, Ran Gilad-Bachrach
|
👻
Ghosted
|
cs.LG
|
272 |
7 years ago |
| 300 |
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Juho Lee, Yoonho Lee, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
270 |
7 years ago |