| 251 |
Abductive Commonsense Reasoning
Chandra Bhagavatula, Ronan Le Bras, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
499 |
6 years ago |
| 252 |
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas, Mohammad Babaeizadeh, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
499 |
3 years ago |
| 253 |
A Fair Comparison of Graph Neural Networks for Graph Classification
Federico Errica, Marco Podda, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
498 |
6 years ago |
| 254 |
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li, Heerad Farkhoor, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
496 |
8 years ago |
| 255 |
Unifying distillation and privileged information
David Lopez-Paz, Léon Bottou, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
496 |
10 years ago |
| 256 |
Eureka: Human-Level Reward Design via Coding Large Language Models
Yecheng Jason Ma, William Liang, ... (+7 more)
|
👻
Ghosted
|
cs.RO
|
495 |
2 years ago |
| 257 |
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
Chen Zhu, Yu Cheng, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
493 |
6 years ago |
| 258 |
Revisiting Few-sample BERT Fine-tuning
Tianyi Zhang, Felix Wu, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
491 |
5 years ago |
| 259 |
Convolutional neural networks with low-rank regularization
Cheng Tai, Tong Xiao, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
489 |
10 years ago |
| 260 |
MaskGAN: Better Text Generation via Filling in the______
William Fedus, Ian Goodfellow, Andrew M. Dai
|
👻
Ghosted
|
stat.ML
|
488 |
8 years ago |
| 261 |
Sensitivity and Generalization in Neural Networks: an Empirical Study
Roman Novak, Yasaman Bahri, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
486 |
8 years ago |
| 262 |
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Shansan Gong, Mukai Li, ... (+3 more)
|
💀
404 Not Found
|
cs.CL
|
486 |
3 years ago |
| 263 |
Generating Images from Captions with Attention
Elman Mansimov, Emilio Parisotto, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
484 |
10 years ago |
| 264 |
Structured Attention Networks
Yoon Kim, Carl Denton, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
483 |
9 years ago |
| 265 |
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach, Ignasi Clavera, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
477 |
8 years ago |
| 266 |
Learning a SAT Solver from Single-Bit Supervision
Daniel Selsam, Matthew Lamm, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
477 |
8 years ago |
| 267 |
Neural Photo Editing with Introspective Adversarial Networks
Andrew Brock, Theodore Lim, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
475 |
9 years ago |
| 268 |
CodeT: Code Generation with Generated Tests
Bei Chen, Fengji Zhang, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
470 |
3 years ago |
| 269 |
Quasi-Recurrent Neural Networks
James Bradbury, Stephen Merity, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
465 |
9 years ago |
| 270 |
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou, Alexander Peysakhovich, Marco Baroni
|
👻
Ghosted
|
cs.CL
|
464 |
9 years ago |
| 271 |
Learn To Pay Attention
Saumya Jetley, Nicholas A. Lord, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
463 |
8 years ago |
| 272 |
Distributional Smoothing with Virtual Adversarial Training
Takeru Miyato, Shin-ichi Maeda, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
463 |
10 years ago |
| 273 |
Dynamics-Aware Unsupervised Discovery of Skills
Archit Sharma, Shixiang Gu, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
461 |
6 years ago |
| 274 |
Empirical Analysis of the Hessian of Over-Parametrized Neural Networks
Levent Sagun, Utku Evci, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
460 |
8 years ago |
| 275 |
Variational Intrinsic Control
Karol Gregor, Danilo Jimenez Rezende, Daan Wierstra
|
👻
Ghosted
|
cs.LG
|
459 |
9 years ago |
| 276 |
Deep Learning for Symbolic Mathematics
Guillaume Lample, François Charton
|
👻
Ghosted
|
cs.SC
|
458 |
6 years ago |
| 277 |
Don't Use Large Mini-Batches, Use Local SGD
Tao Lin, Sebastian U. Stich, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
457 |
7 years ago |
| 278 |
A Compositional Object-Based Approach to Learning Physical Dynamics
Michael B. Chang, Tomer Ullman, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
454 |
9 years ago |
| 279 |
Dataset Augmentation in Feature Space
Terrance DeVries, Graham W. Taylor
|
👻
Ghosted
|
stat.ML
|
452 |
9 years ago |
| 280 |
Are Transformers universal approximators of sequence-to-sequence functions?
Chulhee Yun, Srinadh Bhojanapalli, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
451 |
6 years ago |
| 281 |
DiffTaichi: Differentiable Programming for Physical Simulation
Yuanming Hu, Luke Anderson, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
449 |
6 years ago |
| 282 |
Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning
Michael Lutter, Christian Ritter, Jan Peters
|
👻
Ghosted
|
cs.LG
|
446 |
6 years ago |
| 283 |
Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning
Zeyuan Allen-Zhu, Yuanzhi Li
|
👻
Ghosted
|
cs.LG
|
445 |
5 years ago |
| 284 |
Amortised MAP Inference for Image Super-resolution
Casper Kaae Sønderby, Jose Caballero, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
445 |
9 years ago |
| 285 |
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids
Yunzhu Li, Jiajun Wu, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
444 |
7 years ago |
| 286 |
Reducing Overfitting in Deep Networks by Decorrelating Representations
Michael Cogswell, Faruk Ahmed, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
443 |
10 years ago |
| 287 |
Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware
Florian Tramèr, Dan Boneh
|
👻
Ghosted
|
stat.ML
|
442 |
7 years ago |
| 288 |
Large scale distributed neural network training through online distillation
Rohan Anil, Gabriel Pereyra, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
441 |
8 years ago |
| 289 |
Towards a Neural Statistician
Harrison Edwards, Amos Storkey
|
👻
Ghosted
|
stat.ML
|
439 |
9 years ago |
| 290 |
Soft Weight-Sharing for Neural Network Compression
Karen Ullrich, Edward Meeds, Max Welling
|
👻
Ghosted
|
stat.ML
|
438 |
9 years ago |
| 291 |
Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation
Yangsibo Huang, Samyak Gupta, ... (+3 more)
|
💤
Eternal Rest
|
cs.CL
|
435 |
2 years ago |
| 292 |
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Abulhair Saparov, He He
|
👻
Ghosted
|
cs.CL
|
435 |
3 years ago |
| 293 |
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski, Michael Auli
|
👻
Ghosted
|
cs.CL
|
426 |
7 years ago |
| 294 |
Density Modeling of Images using a Generalized Normalization Transformation
Johannes Ballé, Valero Laparra, Eero P. Simoncelli
|
👻
Ghosted
|
cs.LG
|
425 |
10 years ago |
| 295 |
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Lukas Berglund, Meg Tong, ... (+5 more)
|
💤
Eternal Rest
|
cs.CL
|
425 |
2 years ago |
| 296 |
FigureQA: An Annotated Figure Dataset for Visual Reasoning
Samira Ebrahimi Kahou, Vincent Michalski, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
421 |
8 years ago |
| 297 |
Neural Programmer-Interpreters
Scott Reed, Nando de Freitas
|
👻
Ghosted
|
cs.LG
|
420 |
10 years ago |
| 298 |
Emergent Complexity via Multi-Agent Competition
Trapit Bansal, Jakub Pachocki, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
418 |
8 years ago |
| 299 |
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting Chen, Ruixiang Zhang, Geoffrey Hinton
|
👻
Ghosted
|
cs.CV
|
416 |
3 years ago |
| 300 |
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden K. Petersen, Mikel Landajuela, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
414 |
6 years ago |