How DINO learns to see the world - Paper Explained

  Рет қаралды 6,671

Boris Meinardus

Boris Meinardus

Күн бұрын

Пікірлер
@akshaymundra1052
@akshaymundra1052 8 ай бұрын
Loved your series on self-supervised learning. Are you also planning to cover DINOv2? I am particularly curios about the emergence property of the model -- how it is able to regress semantically consistent features for different parts of the objects (and not simple FG-BG separation as in DINOv1)!
@benmainbird
@benmainbird Жыл бұрын
Great video! Keep it up👍
@borismeinardus
@borismeinardus Жыл бұрын
Genuinely happy to hear you liked it, thanks! ☺️
@江楓漁火-e5u
@江楓漁火-e5u 5 ай бұрын
Hi, I'm a bit confused about the centering method you described in this video(3:25). In your video, you're adding the center to the online network's output, which is different from what I've seen in other implementations of DINO (kzbin.info/www/bejne/nmTMm2Z8aMiDf80si=BUj7iQMXKaEs0Nr1&t=1296). Most implementations subtract the center from the output. Could you please clarify if there's an error in the video or if this is a different approach to centering?
@yossefdiab7452
@yossefdiab7452 10 ай бұрын
great explaination
@borismeinardus
@borismeinardus 10 ай бұрын
thank you ☺️
@nasosgerontopoulos5267
@nasosgerontopoulos5267 Жыл бұрын
Very good content. Congrats 👍. Reading papers can be tough for many people, and such videos make it a lot easier to keep up with these state of the art advancements. As a fellow researcher, do you think investing time in self-supervised learning research is worth it right now? Considering that me and my team do not have access to such computational power as META and Google, I am not sure if we can keep up.
@borismeinardus
@borismeinardus Жыл бұрын
Hey, thanks! 😊 I think it is worth it! SSL is a broad field and SSL in the case of Multi-Modal Learning is very relevant. Yes, you will likely not be able to build the largest foundation models and go for scale, but you can definitely work on more nuanced research. E.g. Imagebind is a great example of a simple idea that does not require all the data and compute in the world. Btw. I also have a video on that paper :) kzbin.info/www/bejne/h4KtZHyIZcabg80si=VYxxIQPiyAXnlsw9
@pankajmaheshwari6033
@pankajmaheshwari6033 22 күн бұрын
In the training of dino I got same loss every time 10.09030 however I changed the Teacher Temperature hyperparameter below 0.06 which written in the paper.?can anyone suggest something beacuse I seen on the internet evryone have same problem with same exact loss value 10.09030 …please wrtite down any golbal solution!!!!
@carsongutierrez7072
@carsongutierrez7072 Жыл бұрын
Transformers~ ML bro~
@borismeinardus
@borismeinardus Жыл бұрын
👾
@menkiguo7805
@menkiguo7805 7 ай бұрын
it dose has the projection head though
Attention in transformers, visually explained | DL6
26:10
3Blue1Brown
Рет қаралды 2 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
Правильный подход к детям
00:18
Beatrise
Рет қаралды 11 МЛН
So Cute 🥰 who is better?
00:15
dednahype
Рет қаралды 19 МЛН
Fixing SimCLRs Main Problem - BYOL Paper Explained
12:18
Boris Meinardus
Рет қаралды 5 М.
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 406 М.
How might LLMs store facts | DL7
22:43
3Blue1Brown
Рет қаралды 903 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,2 МЛН
Can Contrastive Learning Work? -  SimCLR Explained
9:35
Boris Meinardus
Рет қаралды 11 М.
Vision Transformers Need Registers - Fixing a Bug in DINOv2?
9:20
AI Papers Academy
Рет қаралды 2,8 М.
DinoV2 AI Feature Detection and Feature Matching from Meta AI
18:34
Kevin Wood | Robotics & AI
Рет қаралды 2,7 М.
DINO: Self-Supervised Vision Transformers
21:12
Soroush Mehraban
Рет қаралды 3,5 М.
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН