| 401 |
Adversarial Dropout Regularization
Kuniaki Saito, Yoshitaka Ushiku, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
295 |
8 years ago |
| 402 |
Dataset Meta-Learning from Kernel Ridge-Regression
Timothy Nguyen, Zhourong Chen, Jaehoon Lee
|
👻
Ghosted
|
cs.LG
|
294 |
5 years ago |
| 403 |
Projection-Based Constrained Policy Optimization
Tsung-Yen Yang, Justinian Rosca, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
294 |
5 years ago |
| 404 |
Adversarial Manipulation of Deep Representations
Sara Sabour, Yanshuai Cao, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
294 |
10 years ago |
| 405 |
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
Wei Wang, Bin Bi, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
292 |
6 years ago |
| 406 |
Are adversarial examples inevitable?
Ali Shafahi, W. Ronny Huang, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
292 |
7 years ago |
| 407 |
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Evan Zheran Liu, Kelvin Guu, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
292 |
8 years ago |
| 408 |
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh, Iuri Frosio, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
291 |
9 years ago |
| 409 |
Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning
Ruqi Zhang, Chunyuan Li, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
290 |
7 years ago |
| 410 |
Hierarchical Generative Modeling for Controllable Speech Synthesis
Wei-Ning Hsu, Yu Zhang, ... (+10 more)
|
👻
Ghosted
|
cs.CL
|
290 |
7 years ago |
| 411 |
DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong, Chunrui Han, ... (+12 more)
|
👻
Ghosted
|
cs.CV
|
290 |
2 years ago |
| 412 |
Revisiting Self-Training for Neural Sequence Generation
Junxian He, Jiatao Gu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
289 |
6 years ago |
| 413 |
Learning to Act by Predicting the Future
Alexey Dosovitskiy, Vladlen Koltun
|
👻
Ghosted
|
cs.LG
|
289 |
9 years ago |
| 414 |
CausalGAN: Learning Causal Implicit Generative Models with Adversarial Training
Murat Kocaoglu, Christopher Snyder, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
288 |
8 years ago |
| 415 |
Episodic Curiosity through Reachability
Nikolay Savinov, Anton Raichuk, ... (+5 more)
|
🌅
Old Age
|
cs.LG
|
287 |
7 years ago |
| 416 |
Bag of Tricks for Adversarial Training
Tianyu Pang, Xiao Yang, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
285 |
5 years ago |
| 417 |
Explaining Image Classifiers by Counterfactual Generation
Chun-Hao Chang, Elliot Creager, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
285 |
7 years ago |
| 418 |
Neural Logic Machines
Honghua Dong, Jiayuan Mao, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
284 |
7 years ago |
| 419 |
Gradient descent aligns the layers of deep linear networks
Ziwei Ji, Matus Telgarsky
|
👻
Ghosted
|
cs.LG
|
283 |
7 years ago |
| 420 |
Deep Convolutional Networks as shallow Gaussian Processes
Adrià Garriga-Alonso, Carl Edward Rasmussen, Laurence Aitchison
|
👻
Ghosted
|
stat.ML
|
283 |
7 years ago |
| 421 |
A Bayesian Perspective on Generalization and Stochastic Gradient Descent
Samuel L. Smith, Quoc V. Le
|
👻
Ghosted
|
cs.LG
|
283 |
8 years ago |
| 422 |
Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
Amanpreet Singh, Tushar Jain, Sainbayar Sukhbaatar
|
👻
Ghosted
|
cs.LG
|
282 |
7 years ago |
| 423 |
Three Mechanisms of Weight Decay Regularization
Guodong Zhang, Chaoqi Wang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
282 |
7 years ago |
| 424 |
SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data
Alon Brutzkus, Amir Globerson, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
282 |
8 years ago |
| 425 |
Adaptivity of deep ReLU network for learning in Besov and mixed smooth Besov spaces: optimal rate and curse of dimensionality
Taiji Suzuki
|
👻
Ghosted
|
stat.ML
|
281 |
7 years ago |
| 426 |
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning
Abhishek Gupta, Coline Devin, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
281 |
9 years ago |
| 427 |
Discrete Variational Autoencoders
Jason Tyler Rolfe
|
👻
Ghosted
|
stat.ML
|
280 |
9 years ago |
| 428 |
A Compare-Aggregate Model for Matching Text Sequences
Shuohang Wang, Jing Jiang
|
👻
Ghosted
|
cs.CL
|
280 |
9 years ago |
| 429 |
GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing
Tao Yu, Chien-Sheng Wu, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
279 |
5 years ago |
| 430 |
Representation Learning via Invariant Causal Mechanisms
Jovana Mitrovic, Brian McWilliams, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
278 |
5 years ago |
| 431 |
Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy
Danica J. Sutherland, Hsiao-Yu Tung, ... (+5 more)
|
👻
Ghosted
|
stat.ML
|
278 |
9 years ago |
| 432 |
VeRA: Vector-based Random Matrix Adaptation
Dawid J. Kopiczko, Tijmen Blankevoort, Yuki M. Asano
|
👻
Ghosted
|
cs.CL
|
278 |
2 years ago |
| 433 |
What Can Neural Networks Reason About?
Keyulu Xu, Jingling Li, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
276 |
6 years ago |
| 434 |
WRPN: Wide Reduced-Precision Networks
Asit Mishra, Eriko Nurvitadhi, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
276 |
8 years ago |
| 435 |
Natural Language Inference over Interaction Space
Yichen Gong, Heng Luo, Jian Zhang
|
👻
Ghosted
|
cs.CL
|
276 |
8 years ago |
| 436 |
Neural Map: Structured Memory for Deep Reinforcement Learning
Emilio Parisotto, Ruslan Salakhutdinov
|
👻
Ghosted
|
cs.LG
|
274 |
9 years ago |
| 437 |
Learning Visual Predictive Models of Physics for Playing Billiards
Katerina Fragkiadaki, Pulkit Agrawal, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
274 |
10 years ago |
| 438 |
Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking
Michael Sejr Schlichtkrull, Nicola De Cao, Ivan Titov
|
👻
Ghosted
|
cs.CL
|
272 |
5 years ago |
| 439 |
Online Learning Rate Adaptation with Hypergradient Descent
Atilim Gunes Baydin, Robert Cornish, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
272 |
9 years ago |
| 440 |
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao, Yanbin Zhao, ... (+3 more)
|
💀
404 Not Found
|
cs.CL
|
272 |
2 years ago |
| 441 |
Monotonic Chunkwise Attention
Chung-Cheng Chiu, Colin Raffel
|
👻
Ghosted
|
cs.CL
|
269 |
8 years ago |
| 442 |
Neural Programmer: Inducing Latent Programs with Gradient Descent
Arvind Neelakantan, Quoc V. Le, Ilya Sutskever
|
👻
Ghosted
|
cs.LG
|
268 |
10 years ago |
| 443 |
Inductive Matrix Completion Based on Graph Neural Networks
Muhan Zhang, Yixin Chen
|
👻
Ghosted
|
cs.IR
|
267 |
7 years ago |
| 444 |
Routing Networks: Adaptive Selection of Non-linear Functions for Multi-Task Learning
Clemens Rosenbaum, Tim Klinger, Matthew Riemer
|
👻
Ghosted
|
cs.LG
|
267 |
8 years ago |
| 445 |
Learning One-hidden-layer Neural Networks with Landscape Design
Rong Ge, Jason D. Lee, Tengyu Ma
|
👻
Ghosted
|
cs.LG
|
267 |
8 years ago |
| 446 |
Stochastic Controlled Averaging for Federated Learning with Communication Compression
Xinmeng Huang, Ping Li, Xiaoyun Li
|
👻
Ghosted
|
math.OC
|
266 |
2 years ago |
| 447 |
Deep Multi-task Representation Learning: A Tensor Factorisation Approach
Yongxin Yang, Timothy Hospedales
|
👻
Ghosted
|
cs.LG
|
266 |
9 years ago |
| 448 |
Surgical Fine-Tuning Improves Adaptation to Distribution Shifts
Yoonho Lee, Annie S. Chen, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
265 |
3 years ago |
| 449 |
An Exponential Learning Rate Schedule for Deep Learning
Zhiyuan Li, Sanjeev Arora
|
👻
Ghosted
|
cs.LG
|
263 |
6 years ago |
| 450 |
Multilingual Neural Machine Translation with Knowledge Distillation
Xu Tan, Yi Ren, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
262 |
7 years ago |