[One-page summary] Zero1 to 3: Zero shot One Image to 3D Object by Liu et al.

Processing math: 100%

본문 바로가기

Notice

Recent Posts

Recent Comments

Link

Tags more

Archives

Today

Total

관리 메뉴

Computer Vision , AI

[One-page summary] Zero1 to 3: Zero shot One Image to 3D Object by Liu et al. 본문

Paper_review[short]

[One-page summary] Zero1 to 3: Zero shot One Image to 3D Object by Liu et al.

Elune001 2024. 1. 16. 00:24

● Summary:Diffusion model for NeRF

● Approach highlight

Viewpoint-conditioned translation image translation model using a conditional latent diffusion model $\hat{X}_{R,T}=f(x,R,T)$

Score Jacobian Chaining (SJC) for 3d representation:
1. randomly sample viewpoints
2. perform volumetric rendering
3. perturb the resulting images with Gaussian noise ϵ
4. denoise them by applying the Unet $ϵ_{θ}$ conditioned on the input image, posed CLIP embedding and timestep

● Main Results

● Discussion

In fig6 the model doesn't seem to work well with multiple objects. I think that the reason the viewpoint synthesis diffusion model is trained on a single object. (Domain shift problem of diffusion model)

'Paper_review[short]' 카테고리의 다른 글

[One-page summary] TextTo 4D Dynamic Scene Generation by Singer et al. (0)	2024.01.16
[One-page summary] TuneA Video: One Shot Tuning of Image Diffusion Models for Text to Video Generation by Wu et al. (0)	2024.01.16
[One-page summary] Monocular Depth Estimation using Diffusion Models by Saxena et al. (0)	2024.01.16
[One-page summary] NerfDiff: Single image View Synthesis with NeRF guided Distillation from 3D aware Diffusion by Gu et al. (0)	2024.01.16
[One-page summary] Signal Processing for Implicit Neural Representations (NeurIPS 2022) by Xu et al. (0)	2024.01.15

'Paper_review[short]' Related Articles

more

티스토리툴바