Infinite 3D Landmarks: Improving Continuous 2D FacialLandmark Detection

  Рет қаралды 3,761

DisneyResearchHub

DisneyResearchHub

Ай бұрын

Abstract
In this paper, we examine 3 important issues in the practical use of state-of-the-art facial landmark detectors and show how
a combination of specific architectural modifications can directly improve their accuracy and temporal stability. First, many
facial landmark detectors require a face normalization step as a preprocess, often accomplished by a separately-trained neural
network that crops and resizes the face in the input image. There is no guarantee that this pre-trained network performs optimal
face normalization for the task of landmark detection. Thus, we instead analyze the use of a spatial transformer network that
is trained alongside the landmark detector in an unsupervised manner, jointly learning an optimal face normalization and
landmark detection by a single neural network. Second, we show that modifying the output head of the landmark predictor to
infer landmarks in a canonical 3D space rather than directly in 2D can further improve accuracy. To convert the predicted
3D landmarks into screen-space, we additionally predict the camera intrinsics and head pose from the input image. As a side
benefit, this allows to predict the 3D face shape from a given image only using 2D landmarks as supervision, which is useful
in determining landmark visibility among other things. Third, when training a landmark detector on multiple datasets at the
same time, annotation inconsistencies across datasets forces the network to produce a suboptimal average. We propose to add a
semantic correction network to address this issue. This additional lightweight neural network is trained alongside the landmark
detector, without requiring any additional supervision. While the insights of this paper can be applied to most common landmark
detectors, we specifically target a recently-proposed continuous 2D landmark detector to demonstrate how each of our additions
leads to meaningful improvements over the state-of-the-art on standard benchmarks.
Link to publication: studios.disneyresearch.com/20...

Пікірлер
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 251 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 810 М.
Alex hid in the closet #shorts
00:14
Mihdens
Рет қаралды 15 МЛН
New model rc bird unboxing and testing
00:10
Ruhul Shorts
Рет қаралды 28 МЛН
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
Рет қаралды 23 МЛН
Jumping off balcony pulls her tooth! 🫣🦷
01:00
Justin Flom
Рет қаралды 13 МЛН
Learning a Generalized Physical Face Model From Data Video
6:39
DisneyResearchHub
Рет қаралды 2,8 М.
Design and Control of a Bipedal Robotic Character
9:25
DisneyResearchHub
Рет қаралды 54 М.
GEM: Gaussian Eigen Models for Human Heads
5:44
Justus Thies
Рет қаралды 1,6 М.
How AI Discovered a Faster Matrix Multiplication Algorithm
13:00
Quanta Magazine
Рет қаралды 1,4 МЛН
Stylize My Wrinkles: Bridging the Gap from Simulation to Reality
7:14
DisneyResearchHub
Рет қаралды 1,7 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Andrew Ng Machine Learning Career Advice
10:02
Jared Beckwith, R. EEG T.
Рет қаралды 94 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 906 М.
10 weird algorithms
9:06
Fireship
Рет қаралды 1,2 МЛН
1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !
23:20
GoldenBurst
Рет қаралды 1,9 МЛН
Это Xiaomi Su7 Max 🤯 #xiaomi #su7max
1:01
Tynalieff Shorts
Рет қаралды 2,1 МЛН
Look, this is the 97th generation of the phone?
0:13
Edcers
Рет қаралды 7 МЛН
iPhone socket cleaning #Fixit
0:30
Tamar DB (mt)
Рет қаралды 17 МЛН