当前位置:网站首页>Continuous learning, lifelong learning, episodic memory, memory module paper summary -- gradient episodic memory promotes continuous learning
Continuous learning, lifelong learning, episodic memory, memory module paper summary -- gradient episodic memory promotes continuous learning
2022-07-24 07:50:00 【Ahang 626】
Gradient Episodic Memory for Continual Learning
Gradient episodic memory promotes continuous learning
1. Related information :
- Company :Facebook AI research
- Time :2017
- Code :github Mirror address
- The paper :csdn Download address
1. Paper objectives :
For a large number of tasks 、 Continuous learning of a small number of samples , stay MNIST and CIFAR-100 Experiments on data sets
2. Important details :
- Presupposition : Within each task , Data is independent and distributed
- The data shows : A triple : Input , Task identifier 、 Output
- Forward migration : Learn a new task , Impact on future learning follow-up tasks
- Backward migration : Learn a new task , Impact on previously learned tasks
- Episodic memory module : Store data pairs for past tasks
- Continuous learning realizes : All tasks share parameters , Ensure that when learning new tasks and updating parameters , The loss of past task data in memory does not increase , So as to solve catastrophic forgetting
- Measure loss changes : Measured by the included angle of gradient direction , Cosine value is greater than or equal to 0 In the same direction , It will not affect the performance of past tasks , Have good backward migration ability
- Gradient update direction correction : The selection is closest to the original parameter update ( use L2 Norm measures distance ), And meet the projection of measuring loss change conditions
- Quadratic programming and its dual problem
3. Inspire and think :
- Add task descriptor : More detailed task description . Measure the similarity between different tasks through task descriptors , Infer the relationship between tasks , Clear the ambiguity between similar tasks
- High similarity : Learning new tasks can speed up learning , Even realize 0 Second study
- Low similarity : Focus on backward migration , Measure loss changes , It is not necessary to measure the loss change of tasks with high similarity
- Advanced memory management : Build a core set of past tasks
边栏推荐
- Digital twin demonstration project -- Talking about simple pendulum (3) solid model exploration
- CNN-VINS
- Implement a queue with two stacks.
- 觉维设计响应式布局
- The difference between session and cookie
- Example of dictionary
- Do you want to have a robot that can make cartoon avatars in three steps?
- 【Pytorch】conv2d torchvision.transforms
- About the solution of thinking that you download torch as a GPU version, but the result is really a CPU version
- Eight part essay on software testing
猜你喜欢

Postman extracts the token parameter value in the response header and sets it as an environment variable, with code attached

Advanced part of C language VI. file operation

Implement a queue with two stacks.

About the solution of thinking that you download torch as a GPU version, but the result is really a CPU version

Simple Gateway - intranet server safely obtains external network data

Robot operation continuous learning thesis (1) original text reading and Translation -- primitive generation strategy learning without catastrophic forgetting in robot operation

Hcip day 8 notes

Advanced part of C language I. data storage
![2022-07-23: given n items, each item has weight (w[i]) and value (v[i]), only two items can be selected at most, and the weight does not exceed bag. What is the maximum return value? N <= 10^5, w[i] <](/img/f4/ba2706e93f042dd8b110fac0d873c8.png)
2022-07-23: given n items, each item has weight (w[i]) and value (v[i]), only two items can be selected at most, and the weight does not exceed bag. What is the maximum return value? N <= 10^5, w[i] <

Image feature Harris corner detection
随机推荐
OpenGL camera and periodic review
XSS vulnerability learning
requests-爬虫多页爬取肯德基餐厅位置
Solve the problem that Anaconda navigator cannot be opened
Thesis reading: geotransformer
China trichlorosilane Market Forecast and Strategic Research Report (2022 Edition)
简易网闸-内网服务器安全获取外网数据
Intelligent robots and intelligent systems (Professor Zheng Zheng of Dalian University of Technology) -- 1. robots and mobile robots
Talk about compilers based on vscode
MySQL -- subquery scalar subquery
Case practice - panoramic image mosaic: feature matching method
Do you know the use of string?
Selenium basic knowledge paging processing
UNI-APP_ Playback and pause of background music of applet or H5 page
Multiple optimization methods print prime numbers between 100 and 200
Implement a queue with two stacks.
Installation and use of Zen path & defect report & defect operation
Selenium basic knowledge debugging method
Advanced part of C language IV. detailed explanation of user-defined types
【Pytorch】Dataset_ DataLoader