DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

  Рет қаралды 7,829

Ai Ape

Ai Ape

Күн бұрын

Пікірлер: 21
@adityapillai3091
@adityapillai3091 2 ай бұрын
Really good explanation. Would love to see you make more videos. You're very clear and the visual content you present is easily digestible
@aiape6954
@aiape6954 Ай бұрын
Thank you! Started at a start-up and it has eaten my time lol
@adityapillai3091
@adityapillai3091 Ай бұрын
@@aiape6954 Start up grind ain’t no joke fr
@Erosis
@Erosis Жыл бұрын
Awesome explanation!
@pratyushk2693
@pratyushk2693 4 ай бұрын
Really easy to understand! Thanks!
@arseniypolyubin7076
@arseniypolyubin7076 11 ай бұрын
Thanks a lot for this video!
@零鱼芃
@零鱼芃 10 ай бұрын
Amazing work! I really want to know how to decide the cropping parameters based on different datasets. Is it completely based on experience?
@aiape6954
@aiape6954 10 ай бұрын
The research does not explain any optimization strategies of tuning these parameters, so you have to assume it’s some mixture of intuition and trial and error. I would be interested in applying some evolutionary algorithm to find the best parameter set and see if you can push DINO performance.
@leeyuguang4424
@leeyuguang4424 11 ай бұрын
How is it different than DINO itself? I wish there's more explanation.
@elahe4737
@elahe4737 Жыл бұрын
Thank you so much. It was clear and interesting. I have a question please, is it possible to modify the attention maps in this model?
@aiape6954
@aiape6954 11 ай бұрын
Checkout this repo! I use it all the time. github.com/ShirAmir/dino-vit-features/tree/main
@VLM234
@VLM234 8 ай бұрын
that's a great explanation. Are you planning to make a video on the Florence-2 model? I would love to see for livestock use case.
@mortezasjah6168
@mortezasjah6168 4 ай бұрын
Thank you for wrapping up the code and explanation, does your code support multi node implementation? and is there any difference between your notebook and DinoV2 code?
@Kofi-qu9zc
@Kofi-qu9zc 4 ай бұрын
Hi, great video. Had a tangent question, I am trying to use the base pretrained model of DINOV2 from huggingface on the broad institutes BBBC021 dataset of MCF7 breast cancer cells and I'm finding that the CLS embeddings when clustered don't align with the labels (MoA's) in the dataset... Given your experience with DINO, do you think this is due to the cropping strategy used in the pretrained model, and I would have to retrain a bare-bones DINOv2 model on millions of microscopy images to achieve the task of classification correctly? Thanks for any help!
@DevelopmentTeam-b8x
@DevelopmentTeam-b8x Жыл бұрын
well Explained!!
@vizlifestudios
@vizlifestudios 2 ай бұрын
thank you!
@WildWonders7-u9z
@WildWonders7-u9z 7 ай бұрын
Hello i have a paid project on DINO IBOT and DINOV2 will you help?
@aesaerthherbo3783
@aesaerthherbo3783 Жыл бұрын
Amazing explanation, but I think you are just explaining DINO instead of DINOv2.
@aiape6954
@aiape6954 Жыл бұрын
Everything in this video applies to both. The process was optimized for DINOv2 but the structure remained the same.
@rickli3746
@rickli3746 9 ай бұрын
I wonder if you think DINOv2 could be applied to CNNs?
@aiape6954
@aiape6954 9 ай бұрын
My intuition is that it would work but not as well as the transformers. Transformers are slow and computationally expensive but they hold information in a way that CNNs cannot. Probably better off distilling down to a CNN from a transformer.
How DINO learns to see the world - Paper Explained
13:49
Boris Meinardus
Рет қаралды 6 М.
ROSÉ & Bruno Mars - APT. (Official Music Video)
02:54
ROSÉ
Рет қаралды 287 МЛН
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 189 МЛН
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,2 МЛН
Vision Transformers Need Registers - Fixing a Bug in DINOv2?
9:20
AI Papers Academy
Рет қаралды 2,6 М.
DINO: Emerging Properties in Self-Supervised Vision Transformers | Paper Explained!
31:54
Aleksa Gordić - The AI Epiphany
Рет қаралды 12 М.
Vision Transformer and its Applications
34:38
Open Data Science
Рет қаралды 43 М.
DINO: Emerging Properties in Self-Supervised Vision Transformers
52:32
Stanford Contrastive & SS Learning Group
Рет қаралды 6 М.
Machine Learning for Everybody - Full Course
3:53:53
freeCodeCamp.org
Рет қаралды 7 МЛН
Diffusion models from scratch in PyTorch
30:54
DeepFindr
Рет қаралды 257 М.
ROSÉ & Bruno Mars - APT. (Official Music Video)
02:54
ROSÉ
Рет қаралды 287 МЛН