Matthias Niessner

Matthias Niessner

Welcome to our KZbin channel -- I'm excited to share our newest, cutting-edge research with you!

My name is Matthias Niessner and I am a Professor at the Technical University of Munich where I am heading the Visual Computing Lab!
I am also a co-founder of synthesia, a brand-new startup working on cutting-edge AI tech for visual effects and movie dubbing.

Previously, I was a Visiting Assistant Professor with the MPC-VCC in Pat Hanrahan's group at Stanford University; I received my PhD from the University of Erlangen-Nuremberg under supervision of Guenther Greiner.

More information can be found on our group website:
www.niessnerlab.org/

3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt

2:26

3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt

14 күн бұрын

NPGA: Neural Parametric Gaussian Avatars (SIGGRAPH Asia'24)

4:00

NPGA: Neural Parametric Gaussian Avatars (SIGGRAPH Asia'24)

21 күн бұрын

Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation (ECCV'24)

3:01

Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation (ECCV'24)

28 күн бұрын

CVPR 2024 Paper Compilation - TUM Visual Computing Lab & Collaborators

5:02

CVPR 2024 Paper Compilation - TUM Visual Computing Lab & Collaborators

3 ай бұрын

LightIt: Illumination Modeling and Control for Diffusion Models (CVPR'24)

4:16

LightIt: Illumination Modeling and Control for Diffusion Models (CVPR'24)

6 ай бұрын

ViewDiff: 3D-Consistent Image Generation with Text-To-Image Models

3:07

ViewDiff: 3D-Consistent Image Generation with Text-To-Image Models

7 ай бұрын

Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking

3:25

Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking

8 ай бұрын

Intrinsic Image Diffusion for Single-view Material Estimation

3:02

Intrinsic Image Diffusion for Single-view Material Estimation

9 ай бұрын

PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models

2:11

PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models

9 ай бұрын

FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

6:05

FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

9 ай бұрын

MonoNPHM: Dynamic Head Reconstruction from Monocular Videos

3:48

MonoNPHM: Dynamic Head Reconstruction from Monocular Videos

9 ай бұрын

DPHMs: Diffusion Parametric Head Models for Depth-based Tracking

3:32

DPHMs: Diffusion Parametric Head Models for Depth-based Tracking

10 ай бұрын

GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians

2:40

GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians

10 ай бұрын

DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

3:04

DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

10 ай бұрын

SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors

2:56

SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors

10 ай бұрын

MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers

3:48

MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers

10 ай бұрын

TriPlaneNet: An Encoder for EG3D Inversion (WACV'24)

4:55

TriPlaneNet: An Encoder for EG3D Inversion (WACV'24)

11 ай бұрын

ICCV 2023 Paper Compilation - TUM Visual Computing Lab & Collaborators

12:16

ICCV 2023 Paper Compilation - TUM Visual Computing Lab & Collaborators

Жыл бұрын

GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields (SIGGRAPH Asia'2023)

3:02

GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields (SIGGRAPH Asia'2023)

Жыл бұрын

End2End Multi-View Feature Matching with Differentiable Pose Optimization (ICCV'2023)

2:57

End2End Multi-View Feature Matching with Differentiable Pose Optimization (ICCV'2023)

Жыл бұрын

How to Boost Face Recognition with StyleGAN? (ICCV'2023)

4:30

How to Boost Face Recognition with StyleGAN? (ICCV'2023)

Жыл бұрын

SIGGRAPH 2023 Paper Compilation - TUM Visual Computing Lab & Collaborators

16:30

SIGGRAPH 2023 Paper Compilation - TUM Visual Computing Lab & Collaborators

Жыл бұрын

HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion (ICCV'2023)

3:19

HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion (ICCV'2023)

Жыл бұрын

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models (ICCV'2023)

4:31

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models (ICCV'2023)

Жыл бұрын

Text2Tex: Text-driven Texture Synthesis via Diffusion Models (ICCV'2023)

2:26

Text2Tex: Text-driven Texture Synthesis via Diffusion Models (ICCV'2023)

Жыл бұрын

Introduction to Deep Learning (I2DL 2023) - 5. Scaling Optimization

1:32:38

Introduction to Deep Learning (I2DL 2023) - 5. Scaling Optimization

Жыл бұрын

Introduction to Deep Learning (I2DL 2023) - 12. Advanced DL Topics

1:56:32

Introduction to Deep Learning (I2DL 2023) - 12. Advanced DL Topics

Жыл бұрын

Introduction to Deep Learning (I2DL 2023) - 11. RNNs and Transformers

1:09:41

Introduction to Deep Learning (I2DL 2023) - 11. RNNs and Transformers

Жыл бұрын

Introduction to Deep Learning (I2DL 2023) - 10. Architectures

1:33:34

Introduction to Deep Learning (I2DL 2023) - 10. Architectures

Жыл бұрын

Пікірлер

@mimetics3d 13 күн бұрын

Great name

@ShortyTwo 14 күн бұрын

Is the lighting baked in like in the original paper or is it possible to relight the generated heads?

@spider2544 13 күн бұрын

Cant relight what you didn’t capture in the dataset.

@jonasmayer5624

@jonasmayer5624 17 күн бұрын

Incredible work! Also: Levenberg-Marquardt and gaussian splats... this has to be the most Mathias Niessner thing i've ever seen! :D

@RaspiAudio 19 күн бұрын

Looks great but when do you plan to release your source code?

@mattanimation 21 күн бұрын

awesome!

@0609Bhuwan 22 күн бұрын

Wow this is great work...Congratulations to the team !!! Are we going to get to use this or is it only for use by synthesia ??

@김대현-t8c4k 2 ай бұрын

Hello, I appreciate your outstanding research. I have a question. How do I create a scene with texture using a .obj file and an RGB texture image with resolution of 4096 x 4096?

@floribertjackalope2606

@floribertjackalope2606 2 ай бұрын

is it intended that the course site was taken down?

@onurcanisler 2 ай бұрын

Oh dear god. Surely the hardest lecture of all.

@mrburns366 3 ай бұрын

So.. LA Noire 2? 😁

@AaliDGr8 3 ай бұрын

how to use it stpd act WTH did not you give us working link ??? plz

@weelianglien687

@weelianglien687 3 ай бұрын

i wonder in the AlexNet (e.g. 1st conv layer) example, should the kernels be labelled as 11x11x3 due to the RGB, unless the usage of blue colour layers in all the stages signifies that this is only an illustration for the B layer?

@ONDANOTA 3 ай бұрын

thanks! very informative

@alex23361 3 ай бұрын

fantastic

@OneOneTwo3 4 ай бұрын

My neurons do tend to dropout during tests.

@shishen5253 4 ай бұрын

Very impressive work！Will the code for this paper be open source?

@AR-vb4xy 4 ай бұрын

Very interesting video but I suggest a improvement with respect to the display of the lecture: The block on the lower right where the recording of the professor is, should be very small or removed alltogether.

@adriangalvez798

@adriangalvez798 Ай бұрын

Yes please, sometimes it overlaps with the text/notation 🙏🙏🙏

@florisvanderhout2675

@florisvanderhout2675 12 күн бұрын

Slides are in the description

@interfect 4 ай бұрын

Thank you for the great lectures! Is it somehow possible to get the slides for parts 8-12?

@dddd-wf6fn 4 ай бұрын

如果有很多万级带有语义的三维模型，是否可作为训练数据，用three.js加载后，自动输出训练数据？

@MisterWealth 4 ай бұрын

When will the code be made available?

@shan_420 5 ай бұрын

24:00 I think it's a bit confusing with f=Wx and in the image it's xW=f, specially when talking about the dimensionality of W.

@bakikucukcakiroglu

@bakikucukcakiroglu 5 ай бұрын

using the test data over and over again makes it the second phase validation data so doing that can be considered as skipping the test phase.

@rallyworld3417

@rallyworld3417 5 ай бұрын

Impressive

@kasemir0 5 ай бұрын

body, how can i use this code on the Colab? please, help me. tks

@M_a_t_z_ee 5 ай бұрын

Great introductory lecture. I'm excited about the following ones as well as the programming exercises. 😀

@yimloo60 5 ай бұрын

Thank you! Deep Learning! Thats my first time to recognize that i have a goddamn brain! :)

@eskimo2616 6 ай бұрын

19:11 what does "clamp it to zero" mean?

@ulascanzorer 5 ай бұрын

I think it just means we set zero as our minimum so that the values can't go lower than that.

@M_a_t_z_ee 5 ай бұрын

It means that you take two argments for the maximum function: 0 and the function based on previous inputs. This translates to the ReLU (rectified linear unit) activation function. If the second argument is bigger than 0, the max function evaluates to that argument. If the second arguments is negative, the max function evaluates to 0. It "clamping" all negative outputs from the previous layer to 0.

@DaveDFX 6 ай бұрын

Amazing ! Game changer for avatars

@PakkaponPhongtawee

@PakkaponPhongtawee 6 ай бұрын

Could you please upload the supplementary material to the website? In the paper is mentioned that result in relighting can be found at relighting_results.html. and i want to look into how good it can be relight. Thank you.

@georgetang50 6 ай бұрын

Please release the code

@bilalse6862 6 ай бұрын

Very impressive paper, thank you guys !

@Copa20777 7 ай бұрын

This is great..❤

@wolpumba4099 7 ай бұрын

*ELI5 Abstract* *Imagine you have a magic drawing machine!* This machine can understand words and make pictures with just a description. But sometimes, the pictures only show the object from one side, like a flat drawing. *We made the machine even better!* Now it can learn from real photos of objects. We taught it how to make a 3D picture inside its head, so you can see the object from any side, as if it were real! *Our pictures are super cool!* We can tell the machine "a red bouncy ball" or "a fluffy brown dog," and it makes a picture that looks just like the real thing. You can even spin the picture around to see it from all angles. It's like magic! *Abstract* Recent advances in text-guided 2D image generation have spurred interest in 3D asset generation. However, existing text-to-3D methods often produce non-photorealistic objects lacking realistic backgrounds. In this work, we present ViewDiff, a method that leverages the power of pretrained text-to-image diffusion models to generate 3D-consistent images from real-world data. Our key innovation lies in integrating 3D volume rendering and cross-frame attention layers into a U-Net architecture. This enables our model to generate views of an object from any viewpoint in an autoregressive manner. Trained on real-world object datasets, ViewDiff produces instances with diverse, high-quality shapes, textures, and realistic backgrounds. Our results demonstrate superior visual quality and consistency compared to existing methods, as measured by FID and KID scores. disclaimer: i used gemini

@faruknane 7 ай бұрын

Great work!

@dangthanhtuanit

@dangthanhtuanit 7 ай бұрын

I wonder if there is an actual implementation since code is not available?

@adrianstarfinger5721

@adrianstarfinger5721 7 ай бұрын

Impressive work!

@briancunning423

@briancunning423 7 ай бұрын

Amazing. Have you tried feeding the images into photogrammetry software or Gaussian Split software to test the consistency of the 3D?

@lukashollein3985

@lukashollein3985 7 ай бұрын

Yes, this works! You can check the figures 17 to 19 in the paper: lukashoel.github.io/ViewDiff/static/viewdiff_paper.pdf

@manu.vision 7 ай бұрын

😮

@couragefox 7 ай бұрын

Really qant to try it. Please let us know if the code will be released...

@stevenlk 7 ай бұрын

wow that’s impressive

@ritikkothari2787

@ritikkothari2787 7 ай бұрын

for calculating parameters of a particular layer, we do consider he size oof kernal (including depth) which i guess the professor missed! for example in vgg : for first layer - (3X3X3 + 1) * 64 for first layer and (3X3X64 + 1) *64 for the second layer.

@adrianstarfinger5721

@adrianstarfinger5721 7 ай бұрын

45:17 Here you talk about Neural Rendering, e.g. NERF which is using an MLP, and you call it an implicit function. However, in the first lecture we learned that MLP's are actually not implicit functions.