| 101 |
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
Armen Aghajanyan, Luke Zettlemoyer, Sonal Gupta
|
👻
Ghosted
|
cs.LG
|
759 |
5 years ago |
| 102 |
ELI5: Long Form Question Answering
Angela Fan, Yacine Jernite, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
752 |
6 years ago |
| 103 |
Chinese NER Using Lattice LSTM
Yue Zhang, Jie Yang
|
👻
Ghosted
|
cs.CL
|
750 |
7 years ago |
| 104 |
Learning Deep Transformer Models for Machine Translation
Qiang Wang, Bei Li, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
748 |
6 years ago |
| 105 |
ERASER: A Benchmark to Evaluate Rationalized NLP Models
Jay DeYoung, Sarthak Jain, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
736 |
6 years ago |
| 106 |
Visualizing and Understanding Neural Models in NLP
Jiwei Li, Xinlei Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
736 |
10 years ago |
| 107 |
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
Pengcheng Yin, Graham Neubig, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
725 |
5 years ago |
| 108 |
Unsupervised Learning of Sentence Embeddings using Compositional n-Gram Features
Matteo Pagliardini, Prakhar Gupta, Martin Jaggi
|
👻
Ghosted
|
cs.CL
|
724 |
9 years ago |
| 109 |
Neural Approaches to Conversational AI
Jianfeng Gao, Michel Galley, Lihong Li
|
👻
Ghosted
|
cs.CL
|
723 |
7 years ago |
| 110 |
The Web as a Knowledge-base for Answering Complex Questions
Alon Talmor, Jonathan Berant
|
👻
Ghosted
|
cs.CL
|
718 |
8 years ago |
| 111 |
Learning to Ask: Neural Question Generation for Reading Comprehension
Xinya Du, Junru Shao, Claire Cardie
|
👻
Ghosted
|
cs.CL
|
712 |
8 years ago |
| 112 |
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy Lin, Rodrigo Nogueira, Andrew Yates
|
👻
Ghosted
|
cs.IR
|
709 |
5 years ago |
| 113 |
Gender Bias in Coreference Resolution
Rachel Rudinger, Jason Naradowsky, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
708 |
8 years ago |
| 114 |
GLTR: Statistical Detection and Visualization of Generated Text
Sebastian Gehrmann, Hendrik Strobelt, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
707 |
6 years ago |
| 115 |
Dice Loss for Data-imbalanced NLP Tasks
Xiaoya Li, Xiaofei Sun, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
701 |
6 years ago |
| 116 |
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
Yingqi Qu, Yuchen Ding, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
696 |
5 years ago |
| 117 |
Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence
Chi Sun, Luyao Huang, Xipeng Qiu
|
👻
Ghosted
|
cs.CL
|
694 |
7 years ago |
| 118 |
On Measuring Social Biases in Sentence Encoders
Chandler May, Alex Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
691 |
7 years ago |
| 119 |
Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme
Suncong Zheng, Feng Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
691 |
8 years ago |
| 120 |
A Unified MRC Framework for Named Entity Recognition
Xiaoya Li, Jingrong Feng, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
683 |
6 years ago |
| 121 |
A Corpus for Reasoning About Natural Language Grounded in Photographs
Alane Suhr, Stephanie Zhou, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
683 |
7 years ago |
| 122 |
Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations
Eliyahu Kiperwasser, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
681 |
10 years ago |
| 123 |
A Multiscale Visualization of Attention in the Transformer Model
Jesse Vig
|
👻
Ghosted
|
cs.HC
|
673 |
6 years ago |
| 124 |
Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model
Alexander R. Fabbri, Irene Li, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
672 |
6 years ago |
| 125 |
A Stylometric Inquiry into Hyperpartisan and Fake News
Martin Potthast, Johannes Kiesel, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
672 |
9 years ago |
| 126 |
Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations
Sosuke Kobayashi
|
👻
Ghosted
|
cs.CL
|
663 |
7 years ago |
| 127 |
Semi-supervised sequence tagging with bidirectional language models
Matthew E. Peters, Waleed Ammar, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
658 |
8 years ago |
| 128 |
Adversarial Multi-task Learning for Text Classification
Pengfei Liu, Xipeng Qiu, Xuanjing Huang
|
👻
Ghosted
|
cs.CL
|
655 |
9 years ago |
| 129 |
BLiMP: The Benchmark of Linguistic Minimal Pairs for English
Alex Warstadt, Alicia Parrish, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
650 |
6 years ago |
| 130 |
What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models
Allyson Ettinger
|
👻
Ghosted
|
cs.CL
|
648 |
6 years ago |
| 131 |
A Survey on Recent Advances in Named Entity Recognition from Deep Learning models
Vikas Yadav, Steven Bethard
|
👻
Ghosted
|
cs.CL
|
645 |
6 years ago |
| 132 |
Explainable Prediction of Medical Codes from Clinical Text
James Mullenbach, Sarah Wiegreffe, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
636 |
8 years ago |
| 133 |
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
Orhan Firat, Kyunghyun Cho, Yoshua Bengio
|
👻
Ghosted
|
cs.CL
|
634 |
10 years ago |
| 134 |
Discovering Language Model Behaviors with Model-Written Evaluations
Ethan Perez, Sam Ringer, ... (+61 more)
|
💤
Eternal Rest
|
cs.CL
|
632 |
3 years ago |
| 135 |
Harnessing Deep Neural Networks with Logic Rules
Zhiting Hu, Xuezhe Ma, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
631 |
10 years ago |
| 136 |
Explain Yourself! Leveraging Language Models for Commonsense Reasoning
Nazneen Fatema Rajani, Bryan McCann, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
623 |
6 years ago |
| 137 |
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing, Pengtao Xie, Eric Xing
|
👻
Ghosted
|
cs.CL
|
622 |
8 years ago |
| 138 |
Mitigating Gender Bias in Natural Language Processing: Literature Review
Tony Sun, Andrew Gaut, ... (+8 more)
|
👻
Ghosted
|
cs.CL
|
619 |
6 years ago |
| 139 |
A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings
Mikel Artetxe, Gorka Labaka, Eneko Agirre
|
🌅
Old Age
|
cs.CL
|
613 |
7 years ago |
| 140 |
A Hierarchical Neural Autoencoder for Paragraphs and Documents
Jiwei Li, Minh-Thang Luong, Dan Jurafsky
|
👻
Ghosted
|
cs.CL
|
612 |
10 years ago |
| 141 |
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
Yen-Chun Chen, Mohit Bansal
|
👻
Ghosted
|
cs.CL
|
606 |
7 years ago |
| 142 |
Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
Hila Gonen, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
605 |
7 years ago |
| 143 |
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov, James Glass
|
👻
Ghosted
|
cs.CL
|
604 |
7 years ago |
| 144 |
Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies
Max Grusky, Mor Naaman, Yoav Artzi
|
👻
Ghosted
|
cs.CL
|
603 |
7 years ago |
| 145 |
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
Haoming Jiang, Pengcheng He, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
593 |
6 years ago |
| 146 |
Aspect Based Sentiment Analysis with Gated Convolutional Networks
Wei Xue, Tao Li
|
👻
Ghosted
|
cs.CL
|
593 |
7 years ago |
| 147 |
Classifying Relations by Ranking with Convolutional Neural Networks
Cicero Nogueira dos Santos, Bing Xiang, Bowen Zhou
|
👻
Ghosted
|
cs.CL
|
591 |
11 years ago |
| 148 |
Learning Distributed Representations of Sentences from Unlabelled Data
Felix Hill, Kyunghyun Cho, Anna Korhonen
|
👻
Ghosted
|
cs.CL
|
589 |
10 years ago |
| 149 |
A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task
Danqi Chen, Jason Bolton, Christopher D. Manning
|
👻
Ghosted
|
cs.CL
|
583 |
9 years ago |
| 150 |
PAWS: Paraphrase Adversaries from Word Scrambling
Yuan Zhang, Jason Baldridge, Luheng He
|
👻
Ghosted
|
cs.CL
|
580 |
7 years ago |