Dynamic Routing Between Capsules STAT946
Presented by
Yang, Tong(Richard)
Introduction
Hinton's Critiques on CNN
= What is wrong with "standard" neural nets?
- They have too few levels of structure:
- Neurons, Layers, Whole Nets
- We need to group neurons in each layer into "capsules" that do a lot of internal computation and then output a compact result.
- A capsule is inspired by a mini-column.
Mathematical Representations
Capsule
\begin{align} s_j = \sum_{i}c_{ij}\hat{u}_{j|i} \end{align}
where
\begin{align} \hat{u}_{j|i} = W_{ij}u_i \end{align}
Two Key Features of Capsule Network
Squashing
\begin{align} v_j = \frac{||s_j||^2}{1+||s_j||^2}\frac{s_j}{||s_j||} \end{align}
Routing By Agreement
\begin{align} c_{ij} = \frac{exp(b_ij)}{\sum_{k}exp(b_ik)} \end{align}