Skip to main navigation Skip to search Skip to main content

Unsupervised dialectal neural machine translation

  • Wael Farhan
  • , Bashar Talafha
  • , Analle Abuammar
  • , Ruba Jaikat
  • , Mahmoud Al-Ayyoub
  • , Ahmad Bisher Tarakji
  • , Anas Toma
  • Samsung R&D Institute Jordan
  • Jordan University of Science and Technology

Research output: Contribution to journalArticlepeer-review

42 Scopus citations

Abstract

In this paper, we present the first work on unsupervised dialectal Neural Machine Translation (NMT), where the source dialect is not represented in the parallel training corpus. Two systems are proposed for this problem. The first one is the Dialectal to Standard Language Translation (D2SLT) system, which is based on the standard attentional sequence-to-sequence model while introducing two novel ideas leveraging similarities among dialects: using common words as anchor points when learning word embeddings and a decoder scoring mechanism that depends on cosine similarity and language models. The second system is based on the celebrated Google NMT (GNMT) system. We first evaluate these systems in a supervised setting (where the training and testing are done using our parallel corpus of Jordanian dialect and Modern Standard Arabic (MSA)) before going into the unsupervised setting (where we train each system once on a Saudi-MSA parallel corpus and once on an Egyptian-MSA parallel corpus and test them on the Jordanian-MSA parallel corpus). The highest BLEU score obtained in the unsupervised setting is 32.14 (by D2SLT trained on Saudi-MSA data), which is remarkably high compared with the highest BLEU score obtained in the supervised setting, which is 48.25.

Original languageEnglish
Article number102181
JournalInformation Processing and Management
Volume57
Issue number3
DOIs
StatePublished - May 2020
Externally publishedYes

Keywords

  • Neural machine translation
  • Regression-based decoding
  • Shared embedding
  • Unsupervised dialectal translation

Fingerprint

Dive into the research topics of 'Unsupervised dialectal neural machine translation'. Together they form a unique fingerprint.

Cite this