| 51 |
Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo, Aniruddha Kembhavi, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.1K |
9 years ago |
| 52 |
Deep Variational Information Bottleneck
Alexander A. Alemi, Ian Fischer, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
2.0K |
9 years ago |
| 53 |
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Han Cai, Ligeng Zhu, Song Han
|
👻
Ghosted
|
cs.LG
|
2.0K |
7 years ago |
| 54 |
Deep multi-scale video prediction beyond mean square error
Michael Mathieu, Camille Couprie, Yann LeCun
|
👻
Ghosted
|
cs.LG
|
2.0K |
10 years ago |
| 55 |
Predict then Propagate: Graph Neural Networks meet Personalized PageRank
Johannes Gasteiger, Aleksandar Bojchevski, Stephan Günnemann
|
👻
Ghosted
|
cs.LG
|
1.9K |
7 years ago |
| 56 |
End-to-end Optimized Image Compression
Johannes Ballé, Valero Laparra, Eero P. Simoncelli
|
👻
Ghosted
|
cs.CV
|
1.9K |
9 years ago |
| 57 |
Robustness May Be at Odds with Accuracy
Dimitris Tsipras, Shibani Santurkar, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
1.9K |
7 years ago |
| 58 |
Delving into Transferable Adversarial Examples and Black-box Attacks
Yanpei Liu, Xinyun Chen, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.9K |
9 years ago |
| 59 |
Adversarial Feature Learning
Jeff Donahue, Philipp Krähenbühl, Trevor Darrell
|
👻
Ghosted
|
cs.LG
|
1.9K |
9 years ago |
| 60 |
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer, Adam Polyak, ... (+11 more)
|
👻
Ghosted
|
cs.CV
|
1.8K |
3 years ago |
| 61 |
HyperNetworks
David Ha, Andrew Dai, Quoc V. Le
|
👻
Ghosted
|
cs.LG
|
1.8K |
9 years ago |
| 62 |
Adaptive Federated Optimization
Sashank Reddi, Zachary Charles, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
1.8K |
6 years ago |
| 63 |
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong, Wei Ping, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
1.8K |
5 years ago |
| 64 |
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Weijie Su, Xizhou Zhu, ... (+5 more)
|
💀
404 Not Found
|
cs.CV
|
1.8K |
6 years ago |
| 65 |
Word Translation Without Parallel Data
Alexis Conneau, Guillaume Lample, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.7K |
8 years ago |
| 66 |
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin, HyoukJoong Lee, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
1.7K |
5 years ago |
| 67 |
Sequence Level Training with Recurrent Neural Networks
Marc'Aurelio Ranzato, Sumit Chopra, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.7K |
10 years ago |
| 68 |
Neural Combinatorial Optimization with Reinforcement Learning
Irwan Bello, Hieu Pham, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
1.7K |
9 years ago |
| 69 |
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner, Timothy Lillicrap, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.7K |
6 years ago |
| 70 |
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren, Chenxu Hu, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
1.7K |
5 years ago |
| 71 |
Efficient Lifelong Learning with A-GEM
Arslan Chaudhry, Marc'Aurelio Ranzato, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.7K |
7 years ago |
| 72 |
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus, Caiming Xiong, Richard Socher
|
👻
Ghosted
|
cs.CL
|
1.6K |
8 years ago |
| 73 |
Rethinking the Value of Network Pruning
Zhuang Liu, Mingjie Sun, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1.6K |
7 years ago |
| 74 |
DropEdge: Towards Deep Graph Convolutional Networks on Node Classification
Yu Rong, Wenbing Huang, ... (+2 more)
|
💀
404 Not Found
|
cs.LG
|
1.6K |
6 years ago |
| 75 |
Exploration by Random Network Distillation
Yuri Burda, Harrison Edwards, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.6K |
7 years ago |
| 76 |
Countering Adversarial Images using Input Transformations
Chuan Guo, Mayank Rana, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
1.5K |
8 years ago |
| 77 |
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo, Shuo Ren, ... (+16 more)
|
👻
Ghosted
|
cs.SE
|
1.5K |
5 years ago |
| 78 |
Designing Neural Network Architectures using Reinforcement Learning
Bowen Baker, Otkrist Gupta, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.5K |
9 years ago |
| 79 |
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
Dan Hendrycks, Norman Mu, ... (+4 more)
|
🌅
Old Age
|
stat.ML
|
1.5K |
6 years ago |
| 80 |
Meta-Learning with Latent Embedding Optimization
Andrei A. Rusu, Dushyant Rao, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
1.5K |
7 years ago |
| 81 |
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
Lee Xiong, Chenyan Xiong, ... (+6 more)
|
👻
Ghosted
|
cs.IR
|
1.5K |
5 years ago |
| 82 |
Decoupling Representation and Classifier for Long-Tailed Recognition
Bingyi Kang, Saining Xie, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
1.4K |
6 years ago |
| 83 |
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu, Suyog Gupta
|
👻
Ghosted
|
stat.ML
|
1.4K |
8 years ago |
| 84 |
SNIP: Single-shot Network Pruning based on Connection Sensitivity
Namhoon Lee, Thalaiyasingam Ajanthan, Philip H. S. Torr
|
👻
Ghosted
|
cs.CV
|
1.4K |
7 years ago |
| 85 |
Lifelong Learning with Dynamically Expandable Networks
Jaehong Yoon, Eunho Yang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.4K |
8 years ago |
| 86 |
Gradient Descent Provably Optimizes Over-parameterized Neural Networks
Simon S. Du, Xiyu Zhai, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.3K |
7 years ago |
| 87 |
Adversarially Learned Inference
Vincent Dumoulin, Ishmael Belghazi, ... (+5 more)
|
👻
Ghosted
|
stat.ML
|
1.3K |
9 years ago |
| 88 |
Few-Shot Learning with Graph Neural Networks
Victor Garcia, Joan Bruna
|
👻
Ghosted
|
stat.ML
|
1.3K |
8 years ago |
| 89 |
Importance Weighted Autoencoders
Yuri Burda, Roger Grosse, Ruslan Salakhutdinov
|
👻
Ghosted
|
cs.LG
|
1.3K |
10 years ago |
| 90 |
Deep Biaffine Attention for Neural Dependency Parsing
Timothy Dozat, Christopher D. Manning
|
👻
Ghosted
|
cs.CL
|
1.3K |
9 years ago |
| 91 |
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg, Volodymyr Mnih, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
1.3K |
9 years ago |
| 92 |
Learning Sparse Neural Networks through $L_0$ Regularization
Christos Louizos, Max Welling, Diederik P. Kingma
|
👻
Ghosted
|
stat.ML
|
1.3K |
8 years ago |
| 93 |
A Learned Representation For Artistic Style
Vincent Dumoulin, Jonathon Shlens, Manjunath Kudlur
|
👻
Ghosted
|
cs.CV
|
1.2K |
9 years ago |
| 94 |
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
Jason Weston, Antoine Bordes, ... (+5 more)
|
👻
Ghosted
|
cs.AI
|
1.2K |
11 years ago |
| 95 |
Understanding intermediate layers using linear classifier probes
Guillaume Alain, Yoshua Bengio
|
👻
Ghosted
|
stat.ML
|
1.2K |
9 years ago |
| 96 |
Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra, George Tucker, ... (+3 more)
|
👻
Ghosted
|
cs.NE
|
1.2K |
9 years ago |
| 97 |
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach, Abhishek Gupta, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
1.2K |
8 years ago |
| 98 |
Large Language Models Are Human-Level Prompt Engineers
Yongchao Zhou, Andrei Ioan Muresanu, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
1.2K |
3 years ago |
| 99 |
Local SGD Converges Fast and Communicates Little
Sebastian U. Stich
|
👻
Ghosted
|
math.OC
|
1.2K |
7 years ago |
| 100 |
Deep Neural Networks as Gaussian Processes
Jaehoon Lee, Yasaman Bahri, ... (+4 more)
|
👻
Ghosted
|
stat.ML
|
1.2K |
8 years ago |