[NeurIPS 2023] SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding

  Рет қаралды 5,467

Paul-Edouard Sarlin

Paul-Edouard Sarlin

Күн бұрын

Пікірлер: 8
@Unique-Concepts
@Unique-Concepts Жыл бұрын
Fantastic work....Love it👏👏👏🙏🙏👌👌👌👍👍👍
@RicanSamurai
@RicanSamurai Жыл бұрын
Fascinating! Very interesting novel approach to this problem. At 6:39, it appears as though you have ~12 map images that cover the area of interest (of which you highlight four), and then you are able to successfully get a position prediction from a query image. Do you have a sense of how densely that area needs to be covered by your map images before SNAP beats other models? Similarly, is there a map image density at which you see diminishing returns? I'm just curious how many training images are necessary to cover a given region before SNAP's predictions become useful. For that same region in your example, would 50 map images of the region make a meaningful difference to the prediction? Thanks!
@pesarlin
@pesarlin Жыл бұрын
We use a rig with 3 cameras so we actually have 36 images in these examples (each triangle is a camera pose). We have an ablation study in Table 1 in the paper: aerial-only is a bit worse than semantic maps, while StreetView-only is a bit worse than aerial+StreetView. So aerial-only can already get you quite far but having some coverage of ground-level images is important. During training we actually map with fewer images (20 instead of 36) so the model is pretty robust to sparse views, but indeed more is better. I don't have numbers at hand, but I guess that the performance is already quite saturated at 36 views (0.6 per meter), unless there is strong occlusion (e.g. from trucks) in most views.
@HiwotAmlaku
@HiwotAmlaku 11 ай бұрын
Very impressive work! Question: Can I generate a neural map for localization only from birds eye view? Let us say using images from a downward looking camera for a flight from Brussels to Amsterdam.
@mlachahesaidsalimo9958
@mlachahesaidsalimo9958 8 ай бұрын
Your work is incredible ! Thank you for sharing. I really like the dynamism and playfulness of the presentation. Which software did you use to make the video presentation ? Thank you in advance for your reply
@pesarlin
@pesarlin 8 ай бұрын
Thank you! I used only PowerPoint :)
@anywallsocket
@anywallsocket Жыл бұрын
How you choose validation data areas within training data areas?
@pesarlin
@pesarlin Жыл бұрын
We randomly sampled a fixed number of S2 cells in each training city.
Phillip Isola -- When and Why Does Contrastive Learing Work?
21:11
Learning with Limited and Imperfect Data
Рет қаралды 6 М.
Подсадим людей на ставки | ЖБ | 3 серия | Сериал 2024
20:00
ПАЦАНСКИЕ ИСТОРИИ
Рет қаралды 601 М.
Mom Hack for Cooking Solo with a Little One! 🍳👶
00:15
5-Minute Crafts HOUSE
Рет қаралды 22 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 16 МЛН
BAYGUYSTAN | 1 СЕРИЯ | bayGUYS
37:51
bayGUYS
Рет қаралды 1,6 МЛН
[ICCV 2021] Pixel-Perfect Structure-from-Motion with Featuremetric Refinement
10:48
Biggest Breakthroughs in Computer Science: 2023
10:59
Quanta Magazine
Рет қаралды 770 М.
Capsule Networks (CapsNets) - Tutorial
22:22
Aurélien Géron
Рет қаралды 188 М.
Lecture 12 | Visualizing and Understanding
1:15:48
Stanford University School of Engineering
Рет қаралды 255 М.
Contrastive Learning in PyTorch - Part 1: Introduction
14:21
DeepFindr
Рет қаралды 35 М.
Claire Heaney (Imperial) - 3rd December 2024
41:51
Applied and Computational Maths at Cardiff
Рет қаралды 132
1st place all challenges: Paul-Edouard Sarlin
23:28
Torsten Sattler
Рет қаралды 395
Подсадим людей на ставки | ЖБ | 3 серия | Сериал 2024
20:00
ПАЦАНСКИЕ ИСТОРИИ
Рет қаралды 601 М.