From Variational to Deterministic Autoencoders: Difference between revisions
Line 20: | Line 20: | ||
== Framework Architecture == | == Framework Architecture == | ||
The Regularized Autoencoder proposes | === Overview === | ||
The Regularized Autoencoder proposes three modifications to existing VAEs framework. Firstly, eliminating the injection of random noise <math>\epsilon</math> from the reparameterization of the latent variable <math> z </math>. Secondly, it proposes a resigned loss function <math>\mathcal{L}_{RAE}</math>. Finally it proposes a ex-post density estimation procedure for generating samples from the RAE. | |||
=== Eliminating Random Noise === | |||
The authors proposal to eliminate the injection of random noise <math>\epsilon</math> from the reparameterization of the latent variable <math> z = \mu(x) +\sigma(x)\epsilon </math> would result in a Encoder <math.E_{phi} </math> deterministically maps a data point to a latent varible <math> z </math>. | |||
The current varational framework of VAEs enforces regularization on the encoder posterior within it loss function: | |||
\begin{align} | |||
\mathcal{L}_{ELBO} = \mathbb{E}_{z~q_{phi}(z|x)}logp_{\theta}(x|z) + \mathbb{KL}(q_{phi}(z|x) | p(z)) | |||
\end{align} | |||
through its losses KL-divergence term | |||
=== Redesigned Loss Function === | |||
The resigned loss function <math>\mathcal{L}_{RAE}</math> is defined as: | |||
\begin{align} | \begin{align} | ||
\mathcal{L}_{RAE} = \mathcal{L}_{REC} + \beta \mathcal{L}^{RAE}_Z + \lambda \mathcal{L}_{REG} | \mathcal{L}_{RAE} = \mathcal{L}_{REC} + \beta \mathcal{L}^{RAE}_Z + \lambda \mathcal{L}_{REG} | ||
\end{align} | \end{align} | ||
Where <math>\mathcal{L}_{REC}</math> | Where the reconstruction loss <math>\mathcal{L}_{REC}</math> is defined as the mean squared error between input samples and their mean reconstructions <math>\mu_{\theta}</math> by a decoder that is deterministic. In the paper it is formally defined as: | ||
\begin{align} | \begin{align} | ||
\mathcal{L}_{REC} = ||\mathbf{x} - \mathbf{\mu_{\theta}}(E_{\phi}(\mathbf{x}))||_2^2 | \mathcal{L}_{REC} = ||\mathbf{x} - \mathbf{\mu_{\theta}}(E_{\phi}(\mathbf{x}))||_2^2 | ||
\end{align} | \end{align} | ||
However, the decoder <math>D_{\theta}</math> is deterministic the reconstruction loss | However, as the decoder <math>D_{\theta}</math> is deterministic the reconstruction loss is equivalent to: | ||
\begin{align} | \begin{align} | ||
\mathcal{L}_{REC} = ||\mathbf{x} - D_{\theta}(E_{\phi}(\mathbf{x}))||_2^2 | \mathcal{L}_{REC} = ||\mathbf{x} - D_{\theta}(E_{\phi}(\mathbf{x}))||_2^2 | ||
\end{align} | \end{align} | ||
=== Ex-Post Density Estimation === | |||
== Experiment Results == | == Experiment Results == |
Revision as of 13:02, 31 October 2020
Presented by
Partha Ghosh, Mehdi S. M. Sajjadi, Antonio Vergari, Michael Black, Bernhard Scholkopf
Introduction
This paper presents an alternative framework titled Regularized Autoencoders (RAEs) for generative modelling that is deterministic. They investigate how this stochasticity of VAEs could be substituted with implicit and explicit regularization schemes. Furthermore,the present a generative mechanism within a deterministic auto-encoder utilising an ex-post density estimation step that can also be applied to existing VAEs improving their sample quality. They further conduct an empirical comparison between VAEs and deterministic regularized auto-encoders and show the latter are able to generate samples that are comparable or better when applied to images and structured data.
Previous Work
The proposed method modifies the architecture of the existing Varational Autoencoder (VAE) (Kingma & Welling, 2014; Rezende et al., 2014).
Motivation
The authors point to several drawbacks currently associated with VAE's including:
- over-regularisation induced by the KL divergence term within the objective (Tolstikhin et al., 2017)
- posterior collapse in conjunction with powerful decoders (van den Oord et al., 2017)
- increased variance of gradients caused by approximating expectations through sampling (Burda et al., 2015; Tucker et al., 2017)
These issues motivate their consideration of alternatives to the variational framework adopted by VAE's.
Furthermore, the authors consider VAE's introduction of random noise within the reparameterization [math]\displaystyle{ z = \mu(x) +\sigma(x)\epsilon }[/math] as having a regularization effect whereby it promotes the learning if a smoother latent space. This motivates their exploration of alternative regularization schemes for an auto-encoders that could be substituted in place of the VAE's random noise injection to produce equivalent or better generated samples. This would allow for the elimination of the variational framework and its associated drawbacks.
Framework Architecture
Overview
The Regularized Autoencoder proposes three modifications to existing VAEs framework. Firstly, eliminating the injection of random noise [math]\displaystyle{ \epsilon }[/math] from the reparameterization of the latent variable [math]\displaystyle{ z }[/math]. Secondly, it proposes a resigned loss function [math]\displaystyle{ \mathcal{L}_{RAE} }[/math]. Finally it proposes a ex-post density estimation procedure for generating samples from the RAE.
Eliminating Random Noise
The authors proposal to eliminate the injection of random noise [math]\displaystyle{ \epsilon }[/math] from the reparameterization of the latent variable [math]\displaystyle{ z = \mu(x) +\sigma(x)\epsilon }[/math] would result in a Encoder <math.E_{phi} </math> deterministically maps a data point to a latent varible [math]\displaystyle{ z }[/math].
The current varational framework of VAEs enforces regularization on the encoder posterior within it loss function: \begin{align} \mathcal{L}_{ELBO} = \mathbb{E}_{z~q_{phi}(z|x)}logp_{\theta}(x|z) + \mathbb{KL}(q_{phi}(z|x) | p(z)) \end{align}
through its losses KL-divergence term
Redesigned Loss Function
The resigned loss function [math]\displaystyle{ \mathcal{L}_{RAE} }[/math] is defined as: \begin{align} \mathcal{L}_{RAE} = \mathcal{L}_{REC} + \beta \mathcal{L}^{RAE}_Z + \lambda \mathcal{L}_{REG} \end{align} Where the reconstruction loss [math]\displaystyle{ \mathcal{L}_{REC} }[/math] is defined as the mean squared error between input samples and their mean reconstructions [math]\displaystyle{ \mu_{\theta} }[/math] by a decoder that is deterministic. In the paper it is formally defined as: \begin{align} \mathcal{L}_{REC} = ||\mathbf{x} - \mathbf{\mu_{\theta}}(E_{\phi}(\mathbf{x}))||_2^2 \end{align} However, as the decoder [math]\displaystyle{ D_{\theta} }[/math] is deterministic the reconstruction loss is equivalent to: \begin{align} \mathcal{L}_{REC} = ||\mathbf{x} - D_{\theta}(E_{\phi}(\mathbf{x}))||_2^2 \end{align}