| 1 |
Effective Approaches to Attention-based Neural Machine Translation
Minh-Thang Luong, Hieu Pham, Christopher D. Manning
|
🏛️
Transcended
|
cs.CL
|
8.3K |
10 years ago |
| 2 |
A large annotated corpus for learning natural language inference
Samuel R. Bowman, Gabor Angeli, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
4.6K |
10 years ago |
| 3 |
SciBERT: A Pretrained Language Model for Scientific Text
Iz Beltagy, Kyle Lo, Arman Cohan
|
🌅
Old Age
|
cs.CL
|
3.5K |
7 years ago |
| 4 |
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Tan, Mohit Bansal
|
🌅
Old Age
|
cs.CL
|
2.8K |
6 years ago |
| 5 |
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush, Sumit Chopra, Jason Weston
|
👻
Ghosted
|
cs.CL
|
2.8K |
10 years ago |
| 6 |
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
Alexis Conneau, Douwe Kiela, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
9 years ago |
| 7 |
Adversarial Examples for Evaluating Reading Comprehension Systems
Robin Jia, Percy Liang
|
👻
Ghosted
|
cs.CL
|
1.7K |
8 years ago |
| 8 |
Tensor Fusion Network for Multimodal Sentiment Analysis
Amir Zadeh, Minghai Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.6K |
8 years ago |
| 9 |
Text Summarization with Pretrained Encoders
Yang Liu, Mirella Lapata
|
🌅
Old Age
|
cs.CL
|
1.6K |
6 years ago |
| 10 |
RACE: Large-scale ReAding Comprehension Dataset From Examinations
Guokun Lai, Qizhe Xie, ... (+3 more)
|
💀
404 Not Found
|
cs.CL
|
1.6K |
9 years ago |
| 11 |
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui, Dong Huk Park, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
1.6K |
9 years ago |
| 12 |
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh, Oscar Täckström, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
9 years ago |
| 13 |
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li, Will Monroe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
10 years ago |
| 14 |
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
Chia-Wei Liu, Ryan Lowe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
10 years ago |
| 15 |
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin, Yang Ye, ... (+5 more)
|
💀
404 Not Found
|
cs.CV
|
1.3K |
2 years ago |
| 16 |
Sequence-Level Knowledge Distillation
Yoon Kim, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
1.2K |
9 years ago |
| 17 |
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva, Roei Schuster, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
5 years ago |
| 18 |
Long Short-Term Memory-Networks for Machine Reading
Jianpeng Cheng, Li Dong, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 19 |
How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings
Kawin Ethayarajh
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 20 |
Attention is not not Explanation
Sarah Wiegreffe, Yuval Pinter
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 21 |
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Jieyu Zhao, Tianlu Wang, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
1.0K |
8 years ago |
| 22 |
Universal Adversarial Triggers for Attacking and Analyzing NLP
Eric Wallace, Shi Feng, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
6 years ago |
| 23 |
Generating Natural Language Adversarial Examples
Moustafa Alzantot, Yash Sharma, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
8 years ago |
| 24 |
Aspect Level Sentiment Classification with Deep Memory Network
Duyu Tang, Bing Qin, Ting Liu
|
👻
Ghosted
|
cs.CL
|
967 |
10 years ago |
| 25 |
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems
Tsung-Hsien Wen, Milica Gasic, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
967 |
10 years ago |
| 26 |
End-to-end Neural Coreference Resolution
Kenton Lee, Luheng He, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
941 |
8 years ago |
| 27 |
Adversarial Learning for Neural Dialogue Generation
Jiwei Li, Will Monroe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
914 |
9 years ago |
| 28 |
Transfer Learning for Low-Resource Neural Machine Translation
Barret Zoph, Deniz Yuret, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
894 |
10 years ago |
| 29 |
Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling
Diego Marcheggiani, Ivan Titov
|
👻
Ghosted
|
cs.CL
|
869 |
9 years ago |
| 30 |
Rationalizing Neural Predictions
Tao Lei, Regina Barzilay, Tommi Jaakkola
|
👻
Ghosted
|
cs.CL
|
857 |
9 years ago |
| 31 |
Sparse Communication for Distributed Gradient Descent
Alham Fikri Aji, Kenneth Heafield
|
👻
Ghosted
|
cs.CL
|
830 |
9 years ago |
| 32 |
DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning
Wenhan Xiong, Thien Hoang, William Yang Wang
|
👻
Ghosted
|
cs.CL
|
813 |
8 years ago |
| 33 |
Large Language Models Can Self-Improve
Jiaxin Huang, Shixiang Shane Gu, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
784 |
3 years ago |
| 34 |
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
Yuhao Zhang, Peng Qi, Christopher D. Manning
|
🌅
Old Age
|
cs.CL
|
782 |
7 years ago |
| 35 |
Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction
Yi Luan, Luheng He, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
756 |
7 years ago |
| 36 |
TVQA: Localized, Compositional Video Question Answering
Jie Lei, Licheng Yu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
729 |
7 years ago |
| 37 |
Knowledge Enhanced Contextual Word Representations
Matthew E. Peters, Mark Neumann, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
715 |
6 years ago |
| 38 |
Bottom-Up Abstractive Summarization
Sebastian Gehrmann, Yuntian Deng, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
714 |
7 years ago |
| 39 |
Phrase-Based & Neural Unsupervised Machine Translation
Guillaume Lample, Myle Ott, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
702 |
8 years ago |
| 40 |
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Path
Xu Yan, Lili Mou, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
682 |
10 years ago |
| 41 |
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
Wei Zhao, Maxime Peyrard, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
680 |
6 years ago |
| 42 |
Modeling Relation Paths for Representation Learning of Knowledge Bases
Yankai Lin, Zhiyuan Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
648 |
11 years ago |
| 43 |
Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation
Wang Ling, Tiago Luís, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
646 |
10 years ago |
| 44 |
Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach
Wenpeng Yin, Jamaal Hay, Dan Roth
|
🌅
Old Age
|
cs.CL
|
637 |
6 years ago |
| 45 |
DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation
Deepanway Ghosal, Navonil Majumder, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
617 |
6 years ago |
| 46 |
Revealing the Dark Secrets of BERT
Olga Kovaleva, Alexey Romanov, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
612 |
6 years ago |
| 47 |
Sequence-to-Sequence Learning as Beam-Search Optimization
Sam Wiseman, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
612 |
9 years ago |
| 48 |
Designing and Interpreting Probes with Control Tasks
John Hewitt, Percy Liang
|
👻
Ghosted
|
cs.CL
|
608 |
6 years ago |
| 49 |
A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks
Kazuma Hashimoto, Caiming Xiong, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
585 |
9 years ago |
| 50 |
Solving General Arithmetic Word Problems
Subhro Roy, Dan Roth
|
👻
Ghosted
|
cs.CL
|
583 |
9 years ago |