| 1 |
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin, Ming-Wei Chang, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
110.2K |
7 years ago |
| 2 |
Attention is not Explanation
Sarthak Jain, Byron C. Wallace
|
🌅
Old Age
|
cs.CL
|
1.6K |
7 years ago |
| 3 |
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
Alessandro Sordoni, Michel Galley, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
913 |
10 years ago |
| 4 |
A Novel Embedding Model for Knowledge Base Completion Based on Convolutional Neural Network
Dai Quoc Nguyen, Tu Dinh Nguyen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
801 |
8 years ago |
| 5 |
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu, Matt Gardner, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
797 |
7 years ago |
| 6 |
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini, Saadia Gabriel, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
774 |
7 years ago |
| 7 |
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
Mohit Iyyer, John Wieting, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
768 |
8 years ago |
| 8 |
BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis
Hu Xu, Bing Liu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
763 |
7 years ago |
| 9 |
Visualizing and Understanding Neural Models in NLP
Jiwei Li, Xinlei Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
736 |
11 years ago |
| 10 |
Unsupervised Learning of Sentence Embeddings using Compositional n-Gram Features
Matteo Pagliardini, Prakhar Gupta, Martin Jaggi
|
👻
Ghosted
|
cs.CL
|
724 |
9 years ago |
| 11 |
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy Lin, Rodrigo Nogueira, Andrew Yates
|
👻
Ghosted
|
cs.IR
|
709 |
5 years ago |
| 12 |
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
Yingqi Qu, Yuchen Ding, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
696 |
5 years ago |
| 13 |
Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence
Chi Sun, Luyao Huang, Xipeng Qiu
|
👻
Ghosted
|
cs.CL
|
694 |
7 years ago |
| 14 |
Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations
Sosuke Kobayashi
|
👻
Ghosted
|
cs.CL
|
663 |
8 years ago |
| 15 |
Explainable Prediction of Medical Codes from Clinical Text
James Mullenbach, Sarah Wiegreffe, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
636 |
8 years ago |
| 16 |
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
Orhan Firat, Kyunghyun Cho, Yoshua Bengio
|
👻
Ghosted
|
cs.CL
|
634 |
10 years ago |
| 17 |
Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
Hila Gonen, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
605 |
7 years ago |
| 18 |
Learning Distributed Representations of Sentences from Unlabelled Data
Felix Hill, Kyunghyun Cho, Anna Korhonen
|
👻
Ghosted
|
cs.CL
|
589 |
10 years ago |
| 19 |
Learning to Compose Neural Networks for Question Answering
Jacob Andreas, Marcus Rohrbach, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
580 |
10 years ago |
| 20 |
Ranking Sentences for Extractive Summarization with Reinforcement Learning
Shashi Narayan, Shay B. Cohen, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
579 |
8 years ago |
| 21 |
Recurrent Neural Network Grammars
Chris Dyer, Adhiguna Kuncoro, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
540 |
10 years ago |
| 22 |
Colorless green recurrent networks dream hierarchically
Kristina Gulordava, Piotr Bojanowski, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
533 |
8 years ago |
| 23 |
Visual Storytelling
Ting-Hao, Huang, ... (+14 more)
|
👻
Ghosted
|
cs.CL
|
532 |
10 years ago |
| 24 |
Massively Multilingual Neural Machine Translation
Roee Aharoni, Melvin Johnson, Orhan Firat
|
👻
Ghosted
|
cs.CL
|
526 |
7 years ago |
| 25 |
End-to-End Open-Domain Question Answering with BERTserini
Wei Yang, Yuqing Xie, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
517 |
7 years ago |
| 26 |
Counter-fitting Word Vectors to Linguistic Constraints
Nikola Mrkšić, Diarmuid Ó Séaghdha, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
505 |
10 years ago |
| 27 |
Higher-order Coreference Resolution with Coarse-to-fine Inference
Kenton Lee, Luheng He, Luke Zettlemoyer
|
👻
Ghosted
|
cs.CL
|
490 |
8 years ago |
| 28 |
Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks
Ji Young Lee, Franck Dernoncourt
|
👻
Ghosted
|
cs.CL
|
465 |
10 years ago |
| 29 |
Gender Bias in Contextualized Word Embeddings
Jieyu Zhao, Tianlu Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
457 |
7 years ago |
| 30 |
Learning Natural Language Inference with LSTM
Shuohang Wang, Jing Jiang
|
👻
Ghosted
|
cs.CL
|
456 |
10 years ago |
| 31 |
Construction of the Literature Graph in Semantic Scholar
Waleed Ammar, Dirk Groeneveld, ... (+21 more)
|
👻
Ghosted
|
cs.CL
|
427 |
8 years ago |
| 32 |
Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing
Hao Fu, Chunyuan Li, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
417 |
7 years ago |
| 33 |
Competence-based Curriculum Learning for Neural Machine Translation
Emmanouil Antonios Platanios, Otilia Stretcu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
393 |
7 years ago |
| 34 |
Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting
Zhen Qin, Rolf Jagerman, ... (+10 more)
|
👻
Ghosted
|
cs.IR
|
378 |
2 years ago |
| 35 |
Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout
Hao Tan, Licheng Yu, Mohit Bansal
|
👻
Ghosted
|
cs.CL
|
374 |
7 years ago |
| 36 |
When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?
Ye Qi, Devendra Singh Sachan, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
368 |
8 years ago |
| 37 |
A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios
Michael A. Hedderich, Lukas Lange, ... (+3 more)
|
📚
The Cartographer
|
cs.CL
|
358 |
5 years ago |
| 38 |
Identifying and Reducing Gender Bias in Word-Level Language Models
Shikha Bordia, Samuel R. Bowman
|
👻
Ghosted
|
cs.CL
|
357 |
7 years ago |
| 39 |
Text Generation from Knowledge Graphs with Graph Transformers
Rik Koncel-Kedziorski, Dhanush Bekal, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
348 |
7 years ago |
| 40 |
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation
Matt Post, David Vilar
|
👻
Ghosted
|
cs.CL
|
343 |
8 years ago |
| 41 |
A General Framework for Information Extraction using Dynamic Span Graphs
Yi Luan, Dave Wadden, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
335 |
7 years ago |
| 42 |
Multi-Source Neural Translation
Barret Zoph, Kevin Knight
|
👻
Ghosted
|
cs.CL
|
330 |
10 years ago |
| 43 |
Deep Communicating Agents for Abstractive Summarization
Asli Celikyilmaz, Antoine Bosselut, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
315 |
8 years ago |
| 44 |
Toward Abstractive Summarization Using Semantic Representations
Fei Liu, Jeffrey Flanigan, ... (+3 more)
|
💀
404 Not Found
|
cs.CL
|
312 |
8 years ago |
| 45 |
Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog
Sebastian Schuster, Sonal Gupta, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
308 |
7 years ago |
| 46 |
KBGAN: Adversarial Learning for Knowledge Graph Embeddings
Liwei Cai, William Yang Wang
|
👻
Ghosted
|
cs.CL
|
301 |
8 years ago |
| 47 |
TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation
Tao Yu, Zifan Li, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
294 |
8 years ago |
| 48 |
Universal Neural Machine Translation for Extremely Low Resource Languages
Jiatao Gu, Hany Hassan, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
293 |
8 years ago |
| 49 |
What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment
Hongyuan Mei, Mohit Bansal, Matthew R. Walter
|
👻
Ghosted
|
cs.CL
|
292 |
10 years ago |
| 50 |
Bidirectional Recurrent Neural Networks for Medical Event Detection in Electronic Health Records
Abhyuday Jagannatha, Hong Yu
|
👻
Ghosted
|
cs.CL
|
291 |
9 years ago |