[One-page summary] Understanding plasticity in neural networks (arxiv 2023) by Lyle et al.

Notice

Recent Posts

Recent Comments

Link

« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

Computer Vision , AI

[One-page summary] Understanding plasticity in neural networks (arxiv 2023) by Lyle et al. 본문

Paper_review[short]

[One-page summary] Understanding plasticity in neural networks (arxiv 2023) by Lyle et al.

Elune001 2024. 1. 15. 21:29

● Summary: stabilizing the loss landscape is crucial to preserve plasticity

● Approach highlight

They show abrupt task change can drive instability in optimizers and drive plasticity loss

When loss change suddenly, $\hat{m}_{t}$ is updated more aggressively than $\hat{v}_{t}$ and that makes $\hat{u}_{t}$ instability. This simple solution is increasing 𝜖

To understand the impact of optimization method on plasticity loss, they compare gradient descent and random walk (gaussian perturbation) for optimization method on plasticity loss

Smoother loss landscape is both easier to optimize and has been empirically observed to exhibit better generalization

● Main results

Effect of architectural and optimization interventions on plasticity loss

Visualization of relationship between network width and plasticity loss

● Discussion

Why smoother loss landscape has better generalization performance

'Paper_review[short]' 카테고리의 다른 글

[One-page summary] Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models by Wu et al. (1)	2024.01.15
[One-page summary] A Light Touch Approach to Teaching Transformers Multi-view Geometry(CVPR 2023) by Zisserman et al. (0)	2024.01.15
[One-page summary] RECL: Responsive ResourceEfficient Continuous Learning for Video Analytics (NSDI 2023) by Khani et al. (0)	2024.01.15
[One-page summary] A Theoretical Study on Solving Continual Learning (NeurIPS 2022) by Kim et al. (0)	2024.01.15
[One-page summary] ENERGY-BASED MODELS FOR CONTINUAL LEARNING (CoLLAs 2022) by Li et al (0)	2023.04.18

'Paper_review[short]' Related Articles

Computer Vision , AI

[One-page summary] Understanding plasticity in neural networks (arxiv 2023) by Lyle et al. 본문

[One-page summary] Understanding plasticity in neural networks (arxiv 2023) by Lyle et al.

'Paper_review[short]' 카테고리의 다른 글

티스토리툴바