Dynamic Routing Between Capsules STAT946: Difference between revisions
Jump to navigation
Jump to search
Line 14: | Line 14: | ||
* We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result. | * We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result. | ||
** A capsule is inspired by a mini-column. | ** A capsule is inspired by a mini-column. | ||
== What does a capsule represent? == | |||
* Each capsule represents the presence and the instantiation parameters of a multi-dimensional entity of the type that the capsule detects. | |||
* In the visual pathway, for example, a capsule detects a particular type of object or object-part. | |||
* A capsule outputs two things: | |||
** 1. The probability that an object of that type is present. | |||
** 2. The generalized pose of the object which includes position, orientation, scale, deformation, velocity, color etc. | |||
= Mathematical Representations = | = Mathematical Representations = |
Revision as of 21:10, 1 April 2018
Presented by
Yang, Tong(Richard)
Introduction
Hinton's Critiques on CNN
What is wrong with "standard" neural nets?
- They have too few levels of structure:
- Neurons, Layers, Whole Nets
- We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result.
- A capsule is inspired by a mini-column.
What does a capsule represent?
- Each capsule represents the presence and the instantiation parameters of a multi-dimensional entity of the type that the capsule detects.
- In the visual pathway, for example, a capsule detects a particular type of object or object-part.
- A capsule outputs two things:
- 1. The probability that an object of that type is present.
- 2. The generalized pose of the object which includes position, orientation, scale, deformation, velocity, color etc.
Mathematical Representations
Capsule
\begin{align} s_j = \sum_{i}c_{ij}\hat{u}_{j|i} \end{align}
where
\begin{align} \hat{u}_{j|i} = W_{ij}u_i \end{align}
Two Key Features of Capsule Network
Squashing
\begin{align} v_j = \frac{||s_j||^2}{1+||s_j||^2}\frac{s_j}{||s_j||} \end{align}
Routing By Agreement
\begin{align} c_{ij} = \frac{exp(b_ij)}{\sum_{k}exp(b_ik)} \end{align}