stat946w18/Unsupervised Machine Translation Using Monolingual Corpora Only

From statwiki
Revision as of 19:40, 17 February 2018 by Pa2forsy (talk | contribs) (Introduction)
Jump to: navigation, search



Neural machine translation systems must be trained on large corpora consisting of pairs of pre-translated sentences. This paper proposes an unsupervised neural machine translation system, which can be trained without using any such parallel data.

Overview of Translation System

The unsupervised translation system has four components:

  1. An unsupervised word-vector alignment system
  2. An encoder
  3. A decoder
  4. A discriminator

Overview of Objective

The objective function is the sum of three terms:

  1. The de-noising auto-encoder loss
  2. The translation loss
  3. The adversarial loss