A Benchmark of Rule-Based and Neural Coreference Resolution in Dutch Novels and News
November 03, 2020 ยท Entered Twilight ยท ๐ CRAC
"Last commit was 5.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: .gitignore, .travis.yml, LICENSE, README.md, e2edutch, requirements.txt, scripts, setup.py, test
Authors
Corbรจn Poot, Andreas van Cranenburgh
arXiv ID
2011.01615
Category
cs.CL: Computation & Language
Citations
18
Venue
CRAC
Repository
https://github.com/andreasvc/crac2020
โญ 4
Last Checked
4 months ago
Abstract
We evaluate a rule-based (Lee et al., 2013) and neural (Lee et al., 2018) coreference system on Dutch datasets of two domains: literary novels and news/Wikipedia text. The results provide insight into the relative strengths of data-driven and knowledge-driven systems, as well as the influence of domain, document length, and annotation schemes. The neural system performs best on news/Wikipedia text, while the rule-based system performs best on literature. The neural system shows weaknesses with limited training data and long documents, while the rule-based system is affected by annotation differences. The code and models used in this paper are available at https://github.com/andreasvc/crac2020
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age