A Survey of Methods to Leverage Monolingual Data in Low-resource Neural Machine Translation
October 01, 2019 ยท The Cartographer ยท ๐ arXiv.org
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Methods to Leverage Monolingual Data in Low-resource Neural Machine Translation"
Evidence collected by the PWNC Scanner
Authors
Ilshat Gibadullin, Aidar Valeev, Albina Khusainova, Adil Khan
arXiv ID
1910.00373
Category
cs.CL: Computation & Language
Citations
14
Venue
arXiv.org
Last Checked
2 days ago
Abstract
Neural machine translation has become the state-of-the-art for language pairs with large parallel corpora. However, the quality of machine translation for low-resource languages leaves much to be desired. There are several approaches to mitigate this problem, such as transfer learning, semi-supervised and unsupervised learning techniques. In this paper, we review the existing methods, where the main idea is to exploit the power of monolingual data, which, compared to parallel, is usually easier to obtain and significantly greater in amount.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age