Dynamic Routing Between Capsules STAT946: Difference between revisions

From statwiki
Jump to navigation Jump to search
Line 14: Line 14:
* We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result.
* We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result.
** A capsule is inspired by a mini-column.
** A capsule is inspired by a mini-column.
== What does a capsule represent? ==
* Each capsule represents the presence and the instantiation parameters of a multi-dimensional entity of the type that the capsule detects.
* In the visual pathway, for example, a capsule detects a particular type of object or object-part.
* A capsule outputs two things:
** 1. The probability that an object of that type is present.
** 2. The generalized pose of the object which includes position, orientation, scale, deformation, velocity, color etc.


= Mathematical Representations =
= Mathematical Representations =

Revision as of 22:10, 1 April 2018

Presented by

Yang, Tong(Richard)

Introduction

Hinton's Critiques on CNN

What is wrong with "standard" neural nets?

  • They have too few levels of structure:
    • Neurons, Layers, Whole Nets
  • We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result.
    • A capsule is inspired by a mini-column.

What does a capsule represent?

  • Each capsule represents the presence and the instantiation parameters of a multi-dimensional entity of the type that the capsule detects.
  • In the visual pathway, for example, a capsule detects a particular type of object or object-part.
  • A capsule outputs two things:
    • 1. The probability that an object of that type is present.
    • 2. The generalized pose of the object which includes position, orientation, scale, deformation, velocity, color etc.

Mathematical Representations

Capsule

\begin{align} s_j = \sum_{i}c_{ij}\hat{u}_{j|i} \end{align}

where

\begin{align} \hat{u}_{j|i} = W_{ij}u_i \end{align}

Two Key Features of Capsule Network

Squashing

\begin{align} v_j = \frac{||s_j||^2}{1+||s_j||^2}\frac{s_j}{||s_j||} \end{align}

Routing By Agreement

\begin{align} c_{ij} = \frac{exp(b_ij)}{\sum_{k}exp(b_ik)} \end{align}

Empirical Results

MINST

MultiMNIST