stat946w18/Unsupervised Machine Translation Using Monolingual Corpora Only

From statwiki
Revision as of 19:38, 17 February 2018 by Pa2forsy (talk | contribs) (Created page with "File:Under-Construction-Sign.gif == Introduction == Neural Machine translation systems must be trained on large corpora consisting of pairs of pre-translated sentences. T...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Introduction

Neural Machine translation systems must be trained on large corpora consisting of pairs of pre-translated sentences. This paper proposes an unsupervised neural machine translation system, which can be trained without using any such parallel data.

Overview of Translation System

The unsupervised translation system has four components:

  1. An unsupervised word-vector alignment system
  2. An encoder
  3. A decoder
  4. A discriminator

Overview of Objective

The objective function is the sum of three terms:

  1. The de-noising auto-encoder loss
  2. The translation loss
  3. The adversarial loss