| 1 |
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin, Ming-Wei Chang, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
110.2K |
7 years ago |
| 2 |
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers, Ari Holtzman, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
3.7K |
7 years ago |
| 3 |
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich, Barry Haddow, Alexandra Birch
|
👻
Ghosted
|
cs.CL
|
2.9K |
10 years ago |
| 4 |
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson, Mike Schuster, ... (+10 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
9 years ago |
| 5 |
Named Entity Recognition with Bidirectional LSTM-CNNs
Jason P. C. Chiu, Eric Nichols
|
👻
Ghosted
|
cs.CL
|
2.0K |
10 years ago |
| 6 |
OpenNMT: Open-source Toolkit for Neural Machine Translation
Guillaume Klein, Yoon Kim, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.9K |
8 years ago |
| 7 |
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai, Shaojie Bai, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.9K |
7 years ago |
| 8 |
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney, Dipanjan Das, Ellie Pavlick
|
👻
Ghosted
|
cs.CL
|
1.7K |
7 years ago |
| 9 |
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun, Nathan Scales, ... (+9 more)
|
💤
Eternal Rest
|
cs.CL
|
1.7K |
3 years ago |
| 10 |
Incorporating Copying Mechanism in Sequence-to-Sequence Learning
Jiatao Gu, Zhengdong Lu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.6K |
10 years ago |
| 11 |
Attention is not Explanation
Sarthak Jain, Byron C. Wallace
|
🌅
Old Age
|
cs.CL
|
1.6K |
7 years ago |
| 12 |
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Elena Voita, David Talbot, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
7 years ago |
| 13 |
CoQA: A Conversational Question Answering Challenge
Siva Reddy, Danqi Chen, Christopher D. Manning
|
🌅
Old Age
|
cs.CL
|
1.3K |
7 years ago |
| 14 |
End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
Makoto Miwa, Mohit Bansal
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 15 |
Enhanced LSTM for Natural Language Inference
Qian Chen, Xiaodan Zhu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
9 years ago |
| 16 |
Neural Responding Machine for Short-Text Conversation
Lifeng Shang, Zhengdong Lu, Hang Li
|
👻
Ghosted
|
cs.CL
|
1.2K |
11 years ago |
| 17 |
Latent Retrieval for Weakly Supervised Open Domain Question Answering
Kenton Lee, Ming-Wei Chang, Kristina Toutanova
|
👻
Ghosted
|
cs.CL
|
1.1K |
7 years ago |
| 18 |
A Persona-Based Neural Conversation Model
Jiwei Li, Michel Galley, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.1K |
10 years ago |
| 19 |
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change
William L. Hamilton, Jure Leskovec, Dan Jurafsky
|
👻
Ghosted
|
cs.CL
|
977 |
10 years ago |
| 20 |
Effective LSTMs for Target-Dependent Sentiment Classification
Duyu Tang, Bing Qin, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
970 |
10 years ago |
| 21 |
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
Tal Linzen, Emmanuel Dupoux, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
965 |
9 years ago |
| 22 |
ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs
Wenpeng Yin, Hinrich Schütze, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
965 |
10 years ago |
| 23 |
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau, German Kruszewski, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
960 |
8 years ago |
| 24 |
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
Alessandro Sordoni, Michel Galley, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
913 |
10 years ago |
| 25 |
In-Context Retrieval-Augmented Language Models
Ori Ram, Yoav Levine, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
890 |
3 years ago |
| 26 |
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang, Hung-yi Lee
|
👻
Ghosted
|
cs.CL
|
885 |
3 years ago |
| 27 |
Automatic Detection of Fake News
Verónica Pérez-Rosas, Bennett Kleinberg, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
884 |
8 years ago |
| 28 |
Efficient Low-rank Multimodal Fusion with Modality-Specific Factors
Zhun Liu, Ying Shen, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
856 |
8 years ago |
| 29 |
Matching the Blanks: Distributional Similarity for Relation Learning
Livio Baldini Soares, Nicholas FitzGerald, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
839 |
7 years ago |
| 30 |
Neural Summarization by Extracting Sentences and Words
Jianpeng Cheng, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
836 |
10 years ago |
| 31 |
Large Language Models are not Fair Evaluators
Peiyi Wang, Lei Li, ... (+8 more)
|
💀
404 Not Found
|
cs.CL
|
834 |
3 years ago |
| 32 |
Towards Reasoning in Large Language Models: A Survey
Jie Huang, Kevin Chen-Chuan Chang
|
📚
The Cartographer
|
cs.CL
|
833 |
3 years ago |
| 33 |
Transition-Based Dependency Parsing with Stack Long Short-Term Memory
Chris Dyer, Miguel Ballesteros, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
815 |
11 years ago |
| 34 |
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
Harsh Trivedi, Niranjan Balasubramanian, ... (+2 more)
|
💀
404 Not Found
|
cs.CL
|
815 |
3 years ago |
| 35 |
A Novel Embedding Model for Knowledge Base Completion Based on Convolutional Neural Network
Dai Quoc Nguyen, Tu Dinh Nguyen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
801 |
8 years ago |
| 36 |
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu, Matt Gardner, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
797 |
7 years ago |
| 37 |
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini, Saadia Gabriel, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
774 |
7 years ago |
| 38 |
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders
Tiancheng Zhao, Ran Zhao, Maxine Eskenazi
|
👻
Ghosted
|
cs.CL
|
774 |
9 years ago |
| 39 |
Is Attention Interpretable?
Sofia Serrano, Noah A. Smith
|
👻
Ghosted
|
cs.CL
|
769 |
6 years ago |
| 40 |
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
Mohit Iyyer, John Wieting, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
768 |
8 years ago |
| 41 |
BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis
Hu Xu, Bing Liu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
763 |
7 years ago |
| 42 |
Modeling Coverage for Neural Machine Translation
Zhaopeng Tu, Zhengdong Lu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
763 |
10 years ago |
| 43 |
Language to Logical Form with Neural Attention
Li Dong, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
761 |
10 years ago |
| 44 |
Chinese NER Using Lattice LSTM
Yue Zhang, Jie Yang
|
👻
Ghosted
|
cs.CL
|
750 |
8 years ago |
| 45 |
Learning Deep Transformer Models for Machine Translation
Qiang Wang, Bei Li, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
748 |
7 years ago |
| 46 |
ERASER: A Benchmark to Evaluate Rationalized NLP Models
Jay DeYoung, Sarthak Jain, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
736 |
6 years ago |
| 47 |
Visualizing and Understanding Neural Models in NLP
Jiwei Li, Xinlei Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
736 |
11 years ago |
| 48 |
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
Pengcheng Yin, Graham Neubig, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
725 |
6 years ago |
| 49 |
Unsupervised Learning of Sentence Embeddings using Compositional n-Gram Features
Matteo Pagliardini, Prakhar Gupta, Martin Jaggi
|
👻
Ghosted
|
cs.CL
|
724 |
9 years ago |
| 50 |
Neural Approaches to Conversational AI
Jianfeng Gao, Michel Galley, Lihong Li
|
👻
Ghosted
|
cs.CL
|
723 |
7 years ago |