| 101 |
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
Sumanth Dathathri, Andrea Madotto, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 102 |
Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data
Nicolas Papernot, Martín Abadi, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
1.1K |
9 years ago |
| 103 |
Don't Decay the Learning Rate, Increase the Batch Size
Samuel L. Smith, Pieter-Jan Kindermans, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
8 years ago |
| 104 |
Mastering Atari with Discrete World Models
Danijar Hafner, Timothy Lillicrap, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
5 years ago |
| 105 |
Trained Ternary Quantization
Chenzhuo Zhu, Song Han, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
9 years ago |
| 106 |
Deep Double Descent: Where Bigger Models and More Data Hurt
Preetum Nakkiran, Gal Kaplun, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
6 years ago |
| 107 |
Unrolled Generative Adversarial Networks
Luke Metz, Ben Poole, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
9 years ago |
| 108 |
Composition-based Multi-Relational Graph Convolutional Networks
Shikhar Vashishth, Soumya Sanyal, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
1.1K |
6 years ago |
| 109 |
Snapshot Ensembles: Train 1, get M for free
Gao Huang, Yixuan Li, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
1.0K |
9 years ago |
| 110 |
Unsupervised Cross-Domain Image Generation
Yaniv Taigman, Adam Polyak, Lior Wolf
|
👻
Ghosted
|
cs.CV
|
1.0K |
9 years ago |
| 111 |
FractalNet: Ultra-Deep Neural Networks without Residuals
Gustav Larsson, Michael Maire, Gregory Shakhnarovich
|
👻
Ghosted
|
cs.CV
|
1.0K |
9 years ago |
| 112 |
On Detecting Adversarial Perturbations
Jan Hendrik Metzen, Tim Genewein, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
1.0K |
9 years ago |
| 113 |
Particular object retrieval with integral max-pooling of CNN activations
Giorgos Tolias, Ronan Sicre, Hervé Jégou
|
👻
Ghosted
|
cs.CV
|
1.0K |
10 years ago |
| 114 |
Order Matters: Sequence to sequence for sets
Oriol Vinyals, Samy Bengio, Manjunath Kudlur
|
👻
Ghosted
|
stat.ML
|
1.0K |
10 years ago |
| 115 |
Wizard of Wikipedia: Knowledge-Powered Conversational agents
Emily Dinan, Stephen Roller, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
7 years ago |
| 116 |
Generalization through Memorization: Nearest Neighbor Language Models
Urvashi Khandelwal, Omer Levy, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
997 |
6 years ago |
| 117 |
FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models
Will Grathwohl, Ricky T. Q. Chen, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
992 |
7 years ago |
| 118 |
Noisy Networks for Exploration
Meire Fortunato, Mohammad Gheshlaghi Azar, ... (+10 more)
|
👻
Ghosted
|
cs.LG
|
981 |
8 years ago |
| 119 |
InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization
Fan-Yun Sun, Jordan Hoffmann, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
979 |
6 years ago |
| 120 |
Deep Complex Networks
Chiheb Trabelsi, Olexa Bilaniuk, ... (+8 more)
|
👻
Ghosted
|
cs.NE
|
960 |
8 years ago |
| 121 |
Hierarchical Representations for Efficient Architecture Search
Hanxiao Liu, Karen Simonyan, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
949 |
8 years ago |
| 122 |
A Neural Representation of Sketch Drawings
David Ha, Douglas Eck
|
👻
Ghosted
|
cs.NE
|
948 |
9 years ago |
| 123 |
Learning to Navigate in Complex Environments
Piotr Mirowski, Razvan Pascanu, ... (+10 more)
|
👻
Ghosted
|
cs.AI
|
935 |
9 years ago |
| 124 |
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
Yong-Deok Kim, Eunhyeok Park, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
933 |
10 years ago |
| 125 |
Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples
Kimin Lee, Honglak Lee, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
928 |
8 years ago |
| 126 |
Certifying Some Distributional Robustness with Principled Adversarial Training
Aman Sinha, Hongseok Namkoong, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
927 |
8 years ago |
| 127 |
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen, Xiao Wang, ... (+27 more)
|
👻
Ghosted
|
cs.CV
|
926 |
3 years ago |
| 128 |
What do you learn from context? Probing for sentence structure in contextualized word representations
Ian Tenney, Patrick Xia, ... (+9 more)
|
👻
Ghosted
|
cs.CL
|
925 |
6 years ago |
| 129 |
Adversarial Attacks on Neural Network Policies
Sandy Huang, Nicolas Papernot, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
916 |
9 years ago |
| 130 |
An Empirical Study of Example Forgetting during Deep Neural Network Learning
Mariya Toneva, Alessandro Sordoni, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
893 |
7 years ago |
| 131 |
Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference
Matthew Riemer, Ignacio Cases, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
883 |
7 years ago |
| 132 |
Non-Autoregressive Neural Machine Translation
Jiatao Gu, James Bradbury, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
875 |
8 years ago |
| 133 |
Learning to Represent Programs with Graphs
Miltiadis Allamanis, Marc Brockschmidt, Mahmoud Khademi
|
👻
Ghosted
|
cs.LG
|
874 |
8 years ago |
| 134 |
How to train your MAML
Antreas Antoniou, Harrison Edwards, Amos Storkey
|
👻
Ghosted
|
cs.LG
|
862 |
7 years ago |
| 135 |
Universal Transformers
Mostafa Dehghani, Stephan Gouws, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
858 |
7 years ago |
| 136 |
Self-labelling via simultaneous clustering and representation learning
Yuki Markus Asano, Christian Rupprecht, Andrea Vedaldi
|
👻
Ghosted
|
cs.CV
|
858 |
6 years ago |
| 137 |
Generating Wikipedia by Summarizing Long Sequences
Peter J. Liu, Mohammad Saleh, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
854 |
8 years ago |
| 138 |
Entropy-SGD: Biasing Gradient Descent Into Wide Valleys
Pratik Chaudhari, Anna Choromanska, ... (+7 more)
|
👻
Ghosted
|
cs.LG
|
847 |
9 years ago |
| 139 |
Do Deep Generative Models Know What They Don't Know?
Eric Nalisnick, Akihiro Matsukawa, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
834 |
7 years ago |
| 140 |
Multi-task Sequence to Sequence Learning
Minh-Thang Luong, Quoc V. Le, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
833 |
10 years ago |
| 141 |
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small
Kevin Wang, Alexandre Variengien, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
830 |
3 years ago |
| 142 |
PixelDefend: Leveraging Generative Models to Understand and Defend against Adversarial Examples
Yang Song, Taesup Kim, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
827 |
8 years ago |
| 143 |
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu, Katie Luo, Sergey Levine
|
👻
Ghosted
|
cs.LG
|
824 |
8 years ago |
| 144 |
Editing Models with Task Arithmetic
Gabriel Ilharco, Marco Tulio Ribeiro, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
821 |
3 years ago |
| 145 |
InCoder: A Generative Model for Code Infilling and Synthesis
Daniel Fried, Armen Aghajanyan, ... (+8 more)
|
👻
Ghosted
|
cs.SE
|
820 |
4 years ago |
| 146 |
Model compression via distillation and quantization
Antonio Polino, Razvan Pascanu, Dan Alistarh
|
👻
Ghosted
|
cs.NE
|
809 |
8 years ago |
| 147 |
Unsupervised Neural Machine Translation
Mikel Artetxe, Gorka Labaka, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
806 |
8 years ago |
| 148 |
Characterizing Adversarial Subspaces Using Local Intrinsic Dimensionality
Xingjun Ma, Bo Li, ... (+7 more)
|
👻
Ghosted
|
cs.LG
|
804 |
8 years ago |
| 149 |
Sample Efficient Actor-Critic with Experience Replay
Ziyu Wang, Victor Bapst, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
803 |
9 years ago |
| 150 |
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae, Anna Potapenko, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
796 |
6 years ago |