Intrinsic Image Diffusion for Single-view Material Estimation

  Рет қаралды 1,746

Matthias Niessner

Matthias Niessner

Күн бұрын

Project: peter-kocsis.g...
Paper: arxiv.org/abs/...
We present Intrinsic Image Diffusion, a generative model for appearance decomposition of indoor scenes. Given a single input view, we sample multiple possible material explanations represented as albedo, roughness, and metallic maps. Appearance decomposition poses a considerable challenge in computer vision due to the inherent ambiguity between lighting and material properties and the lack of real datasets. To address this issue, we advocate for a probabilistic formulation, where instead of attempting to directly predict the true material properties, we employ a conditional generative model to sample from the solution space. Furthermore, we show that utilizing the strong learned prior of recent diffusion models trained on large-scale real-world images can be adapted to material estimation and highly improves the generalization to real images. Our method produces significantly sharper, more consistent, and more detailed materials, outperforming state-of-the-art methods by 1.5dB on PSNR and by 45% better FID score on albedo prediction. We demonstrate the effectiveness of our approach through experiments on both synthetic and real-world datasets.

Пікірлер: 2
@Eric.Tomaszewski
@Eric.Tomaszewski 9 ай бұрын
Looks amazing. Waiting for the code in your github to try myself. Thanks!
@manu.vision
@manu.vision 7 ай бұрын
😮
Stable Diffusion in Code (AI Image Generation) - Computerphile
16:56
Computerphile
Рет қаралды 293 М.
Miika Aittala: Elucidating the Design Space of Diffusion-Based Generative Models
52:46
Finnish Center for Artificial Intelligence FCAI
Рет қаралды 12 М.
How do Cats Eat Watermelon? 🍉
00:21
One More
Рет қаралды 12 МЛН
Как мы играем в игры 😂
00:20
МЯТНАЯ ФАНТА
Рет қаралды 3,4 МЛН
Win This Dodgeball Game or DIE…
00:36
Alan Chikin Chow
Рет қаралды 44 МЛН
MonoNPHM: Dynamic Head Reconstruction from Monocular Videos
3:48
Matthias Niessner
Рет қаралды 2,8 М.
GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians
2:40
How Stable Diffusion Works (AI Image Generation)
30:21
Gonkee
Рет қаралды 151 М.
BLOOM GOLD: My Truly Stunning Diffusion Filters
8:50
Philip Bloom
Рет қаралды 37 М.
DPHMs: Diffusion Parametric Head Models for Depth-based Tracking
3:32
Matthias Niessner
Рет қаралды 1,5 М.
What are Diffusion Models?
15:28
Ari Seff
Рет қаралды 223 М.
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
6:05
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
ViewDiff: 3D-Consistent Image Generation with Text-To-Image Models
3:07
Matthias Niessner
Рет қаралды 3,1 М.