Revision as of 12:31, 16 November 2017

Introduction & Background

Learning quickly is a hallmark of human intelligence, whether it involves recognizing objects from a few examples or quickly learning new skills after just minutes of experience. In this work, we propose a meta-learning algorithm that is general and model-agnostic, in the sense that it can be directly applied to any learning problem and model that is trained with a gradient descent procedure. Our focus is on deep neural network models, but we illustrate how our approach can easily handle different architectures and different problem settings, including classification, regression, and policy gradient reinforcement learning, with minimal modification. Unlike prior meta-learning methods that learn an update function or learning rule (Schmidhuber, 1987; Bengio et al., 1992; Andrychowicz et al., 2016; Ravi & Larochelle, 2017), this algorithm does not expand the number of learned parameters nor place constraints on the model architecture (e.g. by requiring a recurrent model (Santoro et al., 2016) or a Siamese network (Koch, 2015)), and it can be readily combined with fully connected, convolutional, or recurrent neural networks. It can also be used with a variety of loss functions, including differentiable supervised losses and nondifferentiable reinforcement learning objectives.

The primary contribution of this work is a simple model and task-agnostic algorithm for meta-learning that trains a model’s parameters such that a small number of gradient updates will lead to fast learning on a new task. The paper shows the effectiveness of the proposed algorithm in different domains, including classification, regression, and reinforcement learning problems.

Model-Agnostic Meta Learning (MAML)

Different Types of MAML

Experiments

Conclusion & Critique

References

Schmidhuber, J¨urgen. Learning to control fast-weight memories: An alternative to dynamic recurrent networks. Neural Computation, 1992.

@@ Line 4: / Line 4: @@
 The primary contribution of this work is a simple model and task-agnostic algorithm for meta-learning that trains a model’s parameters such that a small number of gradient updates will lead to fast learning on a new task. The paper shows the effectiveness of the proposed algorithm in different domains, including classification, regression, and reinforcement learning problems.
-='''Learning Modular Policies from Sketches'''=
+='''Model-Agnostic Meta Learning (MAML)'''=
+='''Different Types of MAML'''=
 ='''Experiments'''=

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks: Difference between revisions

Revision as of 12:31, 16 November 2017

Contents

Introduction & Background

Model-Agnostic Meta Learning (MAML)

Different Types of MAML

Experiments

Conclusion & Critique

References

Navigation menu

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks: Difference between revisions

Revision as of 12:31, 16 November 2017

Introduction & Background

Model-Agnostic Meta Learning (MAML)

Different Types of MAML

Experiments

Conclusion & Critique

References

Navigation menu

Search