A Survey of Active Learning for Text Classification using Deep Neural Networks
August 17, 2020 Β· The Cartographer Β· π arXiv.org
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Active Learning for Text Classification using Deep Neural Networks"
Evidence collected by the PWNC Scanner
Authors
Christopher SchrΓΆder, Andreas Niekler
arXiv ID
2008.07267
Category
cs.CL: Computation & Language
Cross-listed
cs.LG
Citations
109
Venue
arXiv.org
Last Checked
1 day ago
Abstract
Natural language processing (NLP) and neural networks (NNs) have both undergone significant changes in recent years. For active learning (AL) purposes, NNs are, however, less commonly used -- despite their current popularity. By using the superior text classification performance of NNs for AL, we can either increase a model's performance using the same amount of data or reduce the data and therefore the required annotation efforts while keeping the same performance. We review AL for text classification using deep neural networks (DNNs) and elaborate on two main causes which used to hinder the adoption: (a) the inability of NNs to provide reliable uncertainty estimates, on which the most commonly used query strategies rely, and (b) the challenge of training DNNs on small data. To investigate the former, we construct a taxonomy of query strategies, which distinguishes between data-based, model-based, and prediction-based instance selection, and investigate the prevalence of these classes in recent research. Moreover, we review recent NN-based advances in NLP like word embeddings or language models in the context of (D)NNs, survey the current state-of-the-art at the intersection of AL, text classification, and DNNs and relate recent advances in NLP to AL. Finally, we analyze recent work in AL for text classification, connect the respective query strategies to the taxonomy, and outline commonalities and shortcomings. As a result, we highlight gaps in current research and present open research questions.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Computation & Language
π
π
Old Age
π
π
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
π
π
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
ποΈ
ποΈ
Transcended
Effective Approaches to Attention-based Neural Machine Translation
π
π
Old Age
A large annotated corpus for learning natural language inference
π
π
Old Age