Revision as of 01:51, 17 November 2018

Group Member

Yu Xuan Lee, Tsen Yee Heng

Background and Introduction

Supervised learning consist of a training set [math]\displaystyle{ D_{tx}=(x_i,y_i)^n_{i=1} }[/math], where [math]\displaystyle{ x_i \in X }[/math] and [math]\displaystyle{ y_i \in Y }[/math]. Empirical Risk Minimization (ERM) is one of the common supervised learning method used to minimize a loss function by having multiple passes over the training set.

[math]\displaystyle{ \frac{1}{|D_{tr}|}\textstyle \sum_{x_i,y_i} \in D_{tr} \ell (f(x_i),y_i) }[/math]

where [math]\displaystyle{ \ell :\mathcal {Y} \times \mathcal {Y} \to [0, \infty) }[/math]

Different to machine learning, datas are being observed sequentially, occurred recurrently, and stored limitedly for learning humans. Thus, the iid assumption is not applicable to ERM. One of the characteristics of ERM is "catastrophic forgetting", which is the problem of recalling past knowledge upon acquiring new ones. To overcome this problem, Gradient Episodic Memory (GEM) is introduced to alleviates forgetting on previous acquired knowledge, while solving new problems more efficiently.

Framework for Continual Learning

Revision as of 01:48, 17 November 2018 (view source) Yxlee (talk \| contribs) No edit summary ← Older edit		Revision as of 01:51, 17 November 2018 (view source) Yxlee (talk \| contribs) No edit summary Newer edit →
Line 18:		Line 18:

	= Framework for Continual Learning =		= Framework for Continual Learning =

	== Task Descriptor ==		== Task Descriptor ==

Gradient Episodic Memory for Continual Learning: Difference between revisions

Revision as of 01:51, 17 November 2018

Contents

Group Member

Background and Introduction

Framework for Continual Learning

Task Descriptor

Navigation menu

Gradient Episodic Memory for Continual Learning: Difference between revisions

Revision as of 01:51, 17 November 2018

Group Member

Background and Introduction

Framework for Continual Learning

Task Descriptor

Navigation menu

Search