A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics and Benchmark Datasets
June 21, 2020 ยท The Cartographer ยท ๐ Applied Sciences
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics and Benchmark Datasets"
Evidence collected by the PWNC Scanner
Authors
Changchang Zeng, Shaobo Li, Qin Li, Jie Hu, Jianjun Hu
arXiv ID
2006.11880
Category
cs.CL: Computation & Language
Cross-listed
cs.AI,
cs.LG
Citations
116
Venue
Applied Sciences
Last Checked
1 day ago
Abstract
Machine Reading Comprehension (MRC) is a challenging Natural Language Processing(NLP) research field with wide real-world applications. The great progress of this field in recent years is mainly due to the emergence of large-scale datasets and deep learning. At present, a lot of MRC models have already surpassed human performance on various benchmark datasets despite the obvious giant gap between existing MRC models and genuine human-level reading comprehension. This shows the need for improving existing datasets, evaluation metrics, and models to move current MRC models toward "real" understanding. To address the current lack of comprehensive survey of existing MRC tasks, evaluation metrics, and datasets, herein, (1) we analyze 57 MRC tasks and datasets and propose a more precise classification method of MRC tasks with 4 different attributes; (2) we summarized 9 evaluation metrics of MRC tasks, 7 attributes and 10 characteristics of MRC datasets; (3) We also discuss key open issues in MRC research and highlighted future research directions. In addition, we have collected, organized, and published our data on the companion website(https://mrc-datasets.github.io/) where MRC researchers could directly access each MRC dataset, papers, baseline projects, and the leaderboard.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐๏ธ
๐๏ธ
Transcended
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age