| 1 |
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers, Ari Holtzman, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
3.7K |
7 years ago |
| 2 |
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich, Barry Haddow, Alexandra Birch
|
👻
Ghosted
|
cs.CL
|
2.9K |
10 years ago |
| 3 |
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson, Mike Schuster, ... (+10 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
9 years ago |
| 4 |
Named Entity Recognition with Bidirectional LSTM-CNNs
Jason P. C. Chiu, Eric Nichols
|
👻
Ghosted
|
cs.CL
|
2.0K |
10 years ago |
| 5 |
OpenNMT: Open-source Toolkit for Neural Machine Translation
Guillaume Klein, Yoon Kim, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.9K |
8 years ago |
| 6 |
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai, Shaojie Bai, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.9K |
7 years ago |
| 7 |
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney, Dipanjan Das, Ellie Pavlick
|
👻
Ghosted
|
cs.CL
|
1.7K |
7 years ago |
| 8 |
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun, Nathan Scales, ... (+9 more)
|
💤
Eternal Rest
|
cs.CL
|
1.7K |
3 years ago |
| 9 |
Incorporating Copying Mechanism in Sequence-to-Sequence Learning
Jiatao Gu, Zhengdong Lu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.6K |
10 years ago |
| 10 |
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Elena Voita, David Talbot, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
7 years ago |
| 11 |
CoQA: A Conversational Question Answering Challenge
Siva Reddy, Danqi Chen, Christopher D. Manning
|
🌅
Old Age
|
cs.CL
|
1.3K |
7 years ago |
| 12 |
End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
Makoto Miwa, Mohit Bansal
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 13 |
Enhanced LSTM for Natural Language Inference
Qian Chen, Xiaodan Zhu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
9 years ago |
| 14 |
Neural Responding Machine for Short-Text Conversation
Lifeng Shang, Zhengdong Lu, Hang Li
|
👻
Ghosted
|
cs.CL
|
1.2K |
11 years ago |
| 15 |
Latent Retrieval for Weakly Supervised Open Domain Question Answering
Kenton Lee, Ming-Wei Chang, Kristina Toutanova
|
👻
Ghosted
|
cs.CL
|
1.1K |
7 years ago |
| 16 |
A Persona-Based Neural Conversation Model
Jiwei Li, Michel Galley, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.1K |
10 years ago |
| 17 |
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change
William L. Hamilton, Jure Leskovec, Dan Jurafsky
|
👻
Ghosted
|
cs.CL
|
977 |
10 years ago |
| 18 |
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
Tal Linzen, Emmanuel Dupoux, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
965 |
9 years ago |
| 19 |
ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs
Wenpeng Yin, Hinrich Schütze, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
965 |
10 years ago |
| 20 |
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau, German Kruszewski, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
960 |
8 years ago |
| 21 |
In-Context Retrieval-Augmented Language Models
Ori Ram, Yoav Levine, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
890 |
3 years ago |
| 22 |
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang, Hung-yi Lee
|
👻
Ghosted
|
cs.CL
|
885 |
3 years ago |
| 23 |
Efficient Low-rank Multimodal Fusion with Modality-Specific Factors
Zhun Liu, Ying Shen, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
856 |
8 years ago |
| 24 |
Matching the Blanks: Distributional Similarity for Relation Learning
Livio Baldini Soares, Nicholas FitzGerald, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
839 |
6 years ago |
| 25 |
Neural Summarization by Extracting Sentences and Words
Jianpeng Cheng, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
836 |
10 years ago |
| 26 |
Large Language Models are not Fair Evaluators
Peiyi Wang, Lei Li, ... (+8 more)
|
💀
404 Not Found
|
cs.CL
|
834 |
3 years ago |
| 27 |
Towards Reasoning in Large Language Models: A Survey
Jie Huang, Kevin Chen-Chuan Chang
|
📚
The Cartographer
|
cs.CL
|
833 |
3 years ago |
| 28 |
Transition-Based Dependency Parsing with Stack Long Short-Term Memory
Chris Dyer, Miguel Ballesteros, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
815 |
11 years ago |
| 29 |
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
Harsh Trivedi, Niranjan Balasubramanian, ... (+2 more)
|
💀
404 Not Found
|
cs.CL
|
815 |
3 years ago |
| 30 |
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders
Tiancheng Zhao, Ran Zhao, Maxine Eskenazi
|
👻
Ghosted
|
cs.CL
|
774 |
9 years ago |
| 31 |
Is Attention Interpretable?
Sofia Serrano, Noah A. Smith
|
👻
Ghosted
|
cs.CL
|
769 |
6 years ago |
| 32 |
Modeling Coverage for Neural Machine Translation
Zhaopeng Tu, Zhengdong Lu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
763 |
10 years ago |
| 33 |
Language to Logical Form with Neural Attention
Li Dong, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
761 |
10 years ago |
| 34 |
Chinese NER Using Lattice LSTM
Yue Zhang, Jie Yang
|
👻
Ghosted
|
cs.CL
|
750 |
8 years ago |
| 35 |
Learning Deep Transformer Models for Machine Translation
Qiang Wang, Bei Li, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
748 |
7 years ago |
| 36 |
ERASER: A Benchmark to Evaluate Rationalized NLP Models
Jay DeYoung, Sarthak Jain, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
736 |
6 years ago |
| 37 |
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
Pengcheng Yin, Graham Neubig, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
725 |
6 years ago |
| 38 |
Neural Approaches to Conversational AI
Jianfeng Gao, Michel Galley, Lihong Li
|
👻
Ghosted
|
cs.CL
|
723 |
7 years ago |
| 39 |
Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme
Suncong Zheng, Feng Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
691 |
8 years ago |
| 40 |
A Unified MRC Framework for Named Entity Recognition
Xiaoya Li, Jingrong Feng, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
683 |
6 years ago |
| 41 |
Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations
Eliyahu Kiperwasser, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
681 |
10 years ago |
| 42 |
A Multiscale Visualization of Attention in the Transformer Model
Jesse Vig
|
👻
Ghosted
|
cs.HC
|
673 |
6 years ago |
| 43 |
A Stylometric Inquiry into Hyperpartisan and Fake News
Martin Potthast, Johannes Kiesel, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
672 |
9 years ago |
| 44 |
Semi-supervised sequence tagging with bidirectional language models
Matthew E. Peters, Waleed Ammar, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
658 |
9 years ago |
| 45 |
Adversarial Multi-task Learning for Text Classification
Pengfei Liu, Xipeng Qiu, Xuanjing Huang
|
👻
Ghosted
|
cs.CL
|
655 |
9 years ago |
| 46 |
What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models
Allyson Ettinger
|
👻
Ghosted
|
cs.CL
|
648 |
6 years ago |
| 47 |
Discovering Language Model Behaviors with Model-Written Evaluations
Ethan Perez, Sam Ringer, ... (+61 more)
|
💤
Eternal Rest
|
cs.CL
|
632 |
3 years ago |
| 48 |
Harnessing Deep Neural Networks with Logic Rules
Zhiting Hu, Xuezhe Ma, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
631 |
10 years ago |
| 49 |
A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings
Mikel Artetxe, Gorka Labaka, Eneko Agirre
|
🌅
Old Age
|
cs.CL
|
613 |
8 years ago |
| 50 |
A Hierarchical Neural Autoencoder for Paragraphs and Documents
Jiwei Li, Minh-Thang Luong, Dan Jurafsky
|
👻
Ghosted
|
cs.CL
|
612 |
11 years ago |