[One-page summary] TuneA Video: One Shot Tuning of Image Diffusion Models for Text to Video Generation by Wu et al.

Notice

Recent Posts

Tags more

Archives

관리 메뉴

Computer Vision , AI

Paper_review[short]

Elune001 2024. 1. 16. 00:27

● Summary: Text to Video generation model using Text to Image diffusion model

● Approach highlight

Spatio-temporal attention for efficiency: attend to selected previous frame( first, previous frame)

T2V generation using T2I model fine-tuning: update only attention block in fine-tuning stage

● Main Results

● Discussion

lack of ability to represent multiple object interactions due to limitations of the underlying diffusion model

[One-page summary] MimicPlay: Long Horizon Imitation Learning by Watching Human Play by Wang et al. (0)	2024.01.16
[One-page summary] TextTo 4D Dynamic Scene Generation by Singer et al. (0)	2024.01.16
[One-page summary] Zero1 to 3: Zero shot One Image to 3D Object by Liu et al. (0)	2024.01.16
[One-page summary] Monocular Depth Estimation using Diffusion Models by Saxena et al. (0)	2024.01.16
[One-page summary] NerfDiff: Single image View Synthesis with NeRF guided Distillation from 3D aware Diffusion by Gu et al. (0)	2024.01.16

'Paper_review[short]' Related Articles