| 1 |
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush, Sumit Chopra, Jason Weston
|
👻
Ghosted
|
cs.CL
|
2.8K |
10 years ago |
| 2 |
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
Alexis Conneau, Douwe Kiela, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
9 years ago |
| 3 |
Adversarial Examples for Evaluating Reading Comprehension Systems
Robin Jia, Percy Liang
|
👻
Ghosted
|
cs.CL
|
1.7K |
8 years ago |
| 4 |
Tensor Fusion Network for Multimodal Sentiment Analysis
Amir Zadeh, Minghai Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.6K |
8 years ago |
| 5 |
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui, Dong Huk Park, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
1.6K |
10 years ago |
| 6 |
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh, Oscar Täckström, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
10 years ago |
| 7 |
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li, Will Monroe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
10 years ago |
| 8 |
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
Chia-Wei Liu, Ryan Lowe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
10 years ago |
| 9 |
Sequence-Level Knowledge Distillation
Yoon Kim, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
1.2K |
9 years ago |
| 10 |
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva, Roei Schuster, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
5 years ago |
| 11 |
Long Short-Term Memory-Networks for Machine Reading
Jianpeng Cheng, Li Dong, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 12 |
How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings
Kawin Ethayarajh
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 13 |
Attention is not not Explanation
Sarah Wiegreffe, Yuval Pinter
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 14 |
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Jieyu Zhao, Tianlu Wang, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
1.0K |
8 years ago |
| 15 |
Universal Adversarial Triggers for Attacking and Analyzing NLP
Eric Wallace, Shi Feng, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
6 years ago |
| 16 |
Generating Natural Language Adversarial Examples
Moustafa Alzantot, Yash Sharma, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
8 years ago |
| 17 |
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems
Tsung-Hsien Wen, Milica Gasic, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
967 |
10 years ago |
| 18 |
Aspect Level Sentiment Classification with Deep Memory Network
Duyu Tang, Bing Qin, Ting Liu
|
👻
Ghosted
|
cs.CL
|
967 |
10 years ago |
| 19 |
End-to-end Neural Coreference Resolution
Kenton Lee, Luheng He, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
941 |
8 years ago |
| 20 |
Adversarial Learning for Neural Dialogue Generation
Jiwei Li, Will Monroe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
914 |
9 years ago |
| 21 |
Transfer Learning for Low-Resource Neural Machine Translation
Barret Zoph, Deniz Yuret, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
894 |
10 years ago |
| 22 |
Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling
Diego Marcheggiani, Ivan Titov
|
👻
Ghosted
|
cs.CL
|
869 |
9 years ago |
| 23 |
Rationalizing Neural Predictions
Tao Lei, Regina Barzilay, Tommi Jaakkola
|
👻
Ghosted
|
cs.CL
|
857 |
9 years ago |
| 24 |
Sparse Communication for Distributed Gradient Descent
Alham Fikri Aji, Kenneth Heafield
|
👻
Ghosted
|
cs.CL
|
830 |
9 years ago |
| 25 |
DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning
Wenhan Xiong, Thien Hoang, William Yang Wang
|
👻
Ghosted
|
cs.CL
|
813 |
8 years ago |
| 26 |
Large Language Models Can Self-Improve
Jiaxin Huang, Shixiang Shane Gu, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
784 |
3 years ago |
| 27 |
Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction
Yi Luan, Luheng He, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
756 |
7 years ago |
| 28 |
TVQA: Localized, Compositional Video Question Answering
Jie Lei, Licheng Yu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
729 |
7 years ago |
| 29 |
Knowledge Enhanced Contextual Word Representations
Matthew E. Peters, Mark Neumann, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
715 |
6 years ago |
| 30 |
Bottom-Up Abstractive Summarization
Sebastian Gehrmann, Yuntian Deng, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
714 |
7 years ago |
| 31 |
Phrase-Based & Neural Unsupervised Machine Translation
Guillaume Lample, Myle Ott, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
702 |
8 years ago |
| 32 |
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Path
Xu Yan, Lili Mou, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
682 |
10 years ago |
| 33 |
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
Wei Zhao, Maxime Peyrard, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
680 |
6 years ago |
| 34 |
Modeling Relation Paths for Representation Learning of Knowledge Bases
Yankai Lin, Zhiyuan Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
648 |
11 years ago |
| 35 |
Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation
Wang Ling, Tiago Luís, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
646 |
10 years ago |
| 36 |
DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation
Deepanway Ghosal, Navonil Majumder, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
617 |
6 years ago |
| 37 |
Sequence-to-Sequence Learning as Beam-Search Optimization
Sam Wiseman, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
612 |
9 years ago |
| 38 |
Revealing the Dark Secrets of BERT
Olga Kovaleva, Alexey Romanov, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
612 |
6 years ago |
| 39 |
Designing and Interpreting Probes with Control Tasks
John Hewitt, Percy Liang
|
👻
Ghosted
|
cs.CL
|
608 |
6 years ago |
| 40 |
A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks
Kazuma Hashimoto, Caiming Xiong, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
585 |
9 years ago |
| 41 |
Solving General Arithmetic Word Problems
Subhro Roy, Dan Roth
|
👻
Ghosted
|
cs.CL
|
583 |
9 years ago |
| 42 |
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Swabha Swayamdipta, Roy Schwartz, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
535 |
5 years ago |
| 43 |
KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning
Bill Yuchen Lin, Xinyue Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
514 |
6 years ago |
| 44 |
Graph Convolutional Encoders for Syntax-aware Neural Machine Translation
Jasmijn Bastings, Ivan Titov, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
509 |
9 years ago |
| 45 |
Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks
Chen Zhang, Qiuchi Li, Dawei Song
|
👻
Ghosted
|
cs.CL
|
506 |
6 years ago |
| 46 |
Don't Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset Biases
Christopher Clark, Mark Yatskar, Luke Zettlemoyer
|
👻
Ghosted
|
cs.CL
|
505 |
6 years ago |
| 47 |
Learning Sequence Encoders for Temporal Knowledge Graph Completion
Alberto García-Durán, Sebastijan Dumančić, Mathias Niepert
|
👻
Ghosted
|
cs.AI
|
504 |
7 years ago |
| 48 |
Why We Need New Evaluation Metrics for NLG
Jekaterina Novikova, Ondřej Dušek, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
493 |
8 years ago |
| 49 |
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das, Harsh Agrawal, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
482 |
9 years ago |
| 50 |
Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement
Jason Lee, Elman Mansimov, Kyunghyun Cho
|
👻
Ghosted
|
cs.LG
|
481 |
8 years ago |