AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion (CVPR 2024)

  Рет қаралды 1,218

Michael Black

Michael Black

Күн бұрын

Пікірлер: 4
@LucasMatuszewski
@LucasMatuszewski 4 ай бұрын
Do you know if it works live? Is inference fast enough to be used for a live 3D AI Avatar? Could you recommend other models usable for this purpose? I'm working with Nvidia Audio2Face currently, the best I've found so far.
@voxyloids8723
@voxyloids8723 3 ай бұрын
Don't think so
@youtou252
@youtou252 7 ай бұрын
this just looks like random hand motion tbh
@LucasMatuszewski
@LucasMatuszewski 4 ай бұрын
Yeah, same with Nvidia Omniverse audio2gesture, looks too random to be usable, but its beter every year, so we are close to usable 3d audio to animation. Nvidia audio2face is already quite good.
Jensen Huang Exposed Deepseek: NVDIA Will Soar 80% | NVDA Stock  Latest News
28:42
Meshcapade avatars in CLO3D
1:07
Meshcapade
Рет қаралды 4,6 М.
Каха и дочка
00:28
К-Media
Рет қаралды 3,4 МЛН
Panoptic Lifting for 3D Scene Understanding with Neural Fields (CVPR'2023)
4:47
China's slaughterbots show WW3 would kill us all.
14:46
Digital Engine
Рет қаралды 1,5 МЛН
[CVPR'24 Best Demo Award] Gaussian Splatting SLAM
7:28
Dyson Robotics Laboratory at Imperial College
Рет қаралды 29 М.
DynamicFusion
6:31
Richard Newcombe
Рет қаралды 65 М.
4 essential body language tips from a world champion public speaker
2:28
Business Insider
Рет қаралды 3,3 МЛН