💀 The Wall of Shame

The most cited papers with no code. Sorted by the weight of their sins.

Page 2, showing 50 papers

# Paper Cause of Death Category Citations Published
51 GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks
Zhao Chen, Vijay Badrinarayanan, ... (+2 more)
👻 Ghosted cs.CV 1.7K 8 years ago
52 Constrained Policy Optimization
Joshua Achiam, David Held, ... (+2 more)
👻 Ghosted cs.LG 1.6K 8 years ago
53 Which Training Methods for GANs do actually Converge?
Lars Mescheder, Andreas Geiger, Sebastian Nowozin
👻 Ghosted cs.LG 1.6K 8 years ago
54 MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
Lu Jiang, Zhengyuan Zhou, ... (+3 more)
🌅 Old Age cs.CV 1.6K 8 years ago
55 A Convergence Theory for Deep Learning via Over-Parameterization
Zeyuan Allen-Zhu, Yuanzhi Li, Zhao Song
👻 Ghosted cs.LG 1.6K 7 years ago
56 Junction Tree Variational Autoencoder for Molecular Graph Generation
Wengong Jin, Regina Barzilay, Tommi Jaakkola
👻 Ghosted cs.LG 1.6K 8 years ago
57 Large-Margin Softmax Loss for Convolutional Neural Networks
Weiyang Liu, Yandong Wen, ... (+2 more)
👻 Ghosted stat.ML 1.6K 9 years ago
58 Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja, Haoran Tang, ... (+2 more)
👻 Ghosted cs.LG 1.5K 9 years ago
59 Train faster, generalize better: Stability of stochastic gradient descent
Moritz Hardt, Benjamin Recht, Yoram Singer
👻 Ghosted cs.LG 1.4K 10 years ago
60 Black-box Adversarial Attacks with Limited Queries and Information
Andrew Ilyas, Logan Engstrom, ... (+2 more)
👻 Ghosted cs.CV 1.3K 8 years ago
61 Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan, Matan Kalman, Yossi Matias
👻 Ghosted cs.LG 1.2K 3 years ago
62 Compressing Neural Networks with the Hashing Trick
Wenlin Chen, James T. Wilson, ... (+3 more)
👻 Ghosted cs.LG 1.2K 11 years ago
63 Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Noam Shazeer, Mitchell Stern
👻 Ghosted cs.LG 1.2K 8 years ago
64 Overcoming catastrophic forgetting with hard attention to the task
Joan Serrà, Dídac Surís, ... (+2 more)
👻 Ghosted cs.LG 1.2K 8 years ago
65 signSGD: Compressed Optimisation for Non-Convex Problems
Jeremy Bernstein, Yu-Xiang Wang, ... (+2 more)
🌅 Old Age cs.LG 1.2K 8 years ago
66 Optimizing Neural Networks with Kronecker-factored Approximate Curvature
James Martens, Roger Grosse
👻 Ghosted cs.LG 1.2K 11 years ago
67 Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Ankit Kumar, Ozan Irsoy, ... (+7 more)
👻 Ghosted cs.CL 1.2K 10 years ago
68 Gradient Descent Finds Global Minima of Deep Neural Networks
Simon S. Du, Jason D. Lee, ... (+3 more)
👻 Ghosted cs.LG 1.2K 7 years ago
69 Ditto: Fair and Robust Federated Learning Through Personalization
Tian Li, Shengyuan Hu, ... (+2 more)
👻 Ghosted cs.LG 1.2K 5 years ago
70 Born Again Neural Networks
Tommaso Furlanello, Zachary C. Lipton, ... (+3 more)
👻 Ghosted stat.ML 1.2K 7 years ago
71 Meta Networks
Tsendsuren Munkhdalai, Hong Yu
👻 Ghosted cs.LG 1.1K 9 years ago
72 OptNet: Differentiable Optimization as a Layer in Neural Networks
Brandon Amos, J. Zico Kolter
👻 Ghosted cs.LG 1.1K 9 years ago
73 Out-of-Distribution Generalization via Risk Extrapolation (REx)
David Krueger, Ethan Caballero, ... (+6 more)
👻 Ghosted cs.LG 1.1K 6 years ago
74 MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing
Sami Abu-El-Haija, Bryan Perozzi, ... (+6 more)
👻 Ghosted cs.LG 1.1K 6 years ago
75 Continuous Deep Q-Learning with Model-based Acceleration
Shixiang Gu, Timothy Lillicrap, ... (+2 more)
👻 Ghosted cs.LG 1.1K 10 years ago
76 Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks
Sanjeev Arora, Simon S. Du, ... (+3 more)
👻 Ghosted cs.LG 1.0K 7 years ago
77 Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn, Sergey Levine, Pieter Abbeel
👻 Ghosted cs.LG 1.0K 10 years ago
78 Toward Controlled Generation of Text
Zhiting Hu, Zichao Yang, ... (+3 more)
🌅 Old Age cs.LG 1.0K 9 years ago
79 Gradient-based Hyperparameter Optimization through Reversible Learning
Dougal Maclaurin, David Duvenaud, Ryan P. Adams
👻 Ghosted stat.ML 1.0K 11 years ago
80 MASS: Masked Sequence to Sequence Pre-training for Language Generation
Kaitao Song, Xu Tan, ... (+3 more)
👻 Ghosted cs.CL 1.0K 6 years ago
81 On Deep Multi-View Representation Learning: Objectives and Optimization
Weiran Wang, Raman Arora, ... (+2 more)
👻 Ghosted cs.LG 1.0K 10 years ago
82 FeUdal Networks for Hierarchical Reinforcement Learning
Alexander Sasha Vezhnevets, Simon Osindero, ... (+5 more)
👻 Ghosted cs.AI 1.0K 9 years ago
83 Data Shapley: Equitable Valuation of Data for Machine Learning
Amirata Ghorbani, James Zou
👻 Ghosted stat.ML 995 7 years ago
84 Texture Networks: Feed-forward Synthesis of Textures and Stylized Images
Dmitry Ulyanov, Vadim Lebedev, ... (+2 more)
👻 Ghosted cs.CV 994 10 years ago
85 Robust Adversarial Reinforcement Learning
Lerrel Pinto, James Davidson, ... (+2 more)
👻 Ghosted cs.LG 985 9 years ago
86 Implicit Geometric Regularization for Learning Shapes
Amos Gropp, Lior Yariv, ... (+3 more)
👻 Ghosted cs.LG 978 6 years ago
87 QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son, Daewoo Kim, ... (+3 more)
👻 Ghosted cs.LG 974 6 years ago
88 Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering
Bo Yang, Xiao Fu, ... (+2 more)
👻 Ghosted cs.LG 963 9 years ago
89 GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models
Jiaxuan You, Rex Ying, ... (+3 more)
👻 Ghosted cs.LG 961 8 years ago
90 The Hidden Vulnerability of Distributed Learning in Byzantium
El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault
👻 Ghosted stat.ML 940 8 years ago
91 MADE: Masked Autoencoder for Distribution Estimation
Mathieu Germain, Karol Gregor, ... (+2 more)
👻 Ghosted cs.LG 925 11 years ago
92 How to Escape Saddle Points Efficiently
Chi Jin, Rong Ge, ... (+3 more)
👻 Ghosted cs.LG 907 9 years ago
93 Generative Moment Matching Networks
Yujia Li, Kevin Swersky, Richard Zemel
👻 Ghosted cs.LG 899 11 years ago
94 Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal, Fei Sha
👻 Ghosted cs.LG 896 7 years ago
95 Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang, Daisy Stanton, ... (+8 more)
👻 Ghosted cs.CL 893 8 years ago
96 Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord, Yazhe Li, ... (+20 more)
👻 Ghosted cs.LG 893 8 years ago
97 Compressed Sensing using Generative Models
Ashish Bora, Ajil Jalal, ... (+2 more)
👻 Ghosted stat.ML 893 9 years ago
98 Gated Feedback Recurrent Neural Networks
Junyoung Chung, Caglar Gulcehre, ... (+2 more)
👻 Ghosted cs.NE 884 11 years ago
99 Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness
Michael Kearns, Seth Neel, ... (+2 more)
👻 Ghosted cs.LG 866 8 years ago
100 A Theoretical Analysis of Contrastive Unsupervised Representation Learning
Sanjeev Arora, Hrishikesh Khandeparkar, ... (+3 more)
👻 Ghosted cs.LG 859 7 years ago