A Survey of Methods to Leverage Monolingual Data in Low-resource Neural Machine Translation

October 01, 2019 · The Cartographer · 🏛 arXiv.org

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Methods to Leverage Monolingual Data in Low-resource Neural Machine Translation"

Evidence collected by the PWNC Scanner

Authors Ilshat Gibadullin, Aidar Valeev, Albina Khusainova, Adil Khan arXiv ID 1910.00373 Category cs.CL: Computation & Language Citations 14 Venue arXiv.org Last Checked 2 days ago

Abstract

Neural machine translation has become the state-of-the-art for language pairs with large parallel corpora. However, the quality of machine translation for low-resource languages leaves much to be desired. There are several approaches to mitigate this problem, such as transfer learning, semi-supervised and unsupervised learning techniques. In this paper, we review the existing methods, where the main idea is to exploit the power of monolingual data, which, compared to parallel, is usually easier to obtain and significantly greater in amount.