OpenAI CLIP Explained | Multi-modal ML

  Рет қаралды 24,071

James Briggs

James Briggs

Күн бұрын

Пікірлер
@ricardojung3849
@ricardojung3849 2 жыл бұрын
Thanks for reporting, explaining and lastly opening up recent ML! I found clip to be very interesting since I always frowned at the lost potential of two different embeddings being arbitrary and methodically separate. This is huge!
@jamesbriggs
@jamesbriggs 2 жыл бұрын
yes there will be plenty more on CLIP and other similar models very soon - some of stuff I've built (and will demo) is awesome and nothing more than zero-shot CLIP, excited to share!
@mszak50
@mszak50 Жыл бұрын
This was really excellent - some of the pieces are starting to make sense
@konichiwatanabi
@konichiwatanabi Жыл бұрын
Thank you so much for this great walkthrough! Looking forward to more
@DallanQuass
@DallanQuass 2 жыл бұрын
Great video! Looking forward to your next video diving more into using CLIP for zero-shot classification!
@jamesbriggs
@jamesbriggs 2 жыл бұрын
Me too, it's fascinating. Thanks for watching!
@ismailashraq9697
@ismailashraq9697 2 жыл бұрын
This is amazing James. Thanks for the detailed explanation. I am excited for the future CLIP videos 🙂.
@jamesbriggs
@jamesbriggs 2 жыл бұрын
Thanks Ashraq! As you know, I'm excited for them too
Жыл бұрын
Thanks James, very good video about CLIP. Funny thing is that you display twice the cos_sim, so the second time it is not the dot_sim which is displayed. And you fighted to find any difference between the two similarity matrices. LOL 🤣
@jamesbriggs
@jamesbriggs Жыл бұрын
ah did I do that, oops 😅
@justinmiller7150
@justinmiller7150 Жыл бұрын
Great video. I think you may be plotting the same graph twice though (cos sim). In practice it is almost the same though it would seem.
@adrianarroyo9839
@adrianarroyo9839 Жыл бұрын
Nice video and explanation! I think on min 28:45 you plotted cos_sim instead of dot_sim!
@AdeleHaghighatHoseiniA
@AdeleHaghighatHoseiniA Жыл бұрын
Thank you for the good explanation, if we have 2 different embeddings like texts and 3D images, we can use CLIP to predict images?
@abdirahmann
@abdirahmann Жыл бұрын
is there a hosted API for clip where you can provide your image data and get the vectors instead of having to host it yourself, kinda like how you give an input to `ada-002`?
@behnamplays
@behnamplays 2 жыл бұрын
Excellent content! As a suggestion, can you please keep the images/diagrams a bit longer? They move pretty fast in the video, which means I'll have to rewind the video every now and then.
@jamesbriggs
@jamesbriggs 2 жыл бұрын
Sure that’s great feedback, thanks!
@valentinfontanger4962
@valentinfontanger4962 Жыл бұрын
Excellent video
@debashisghosh3133
@debashisghosh3133 2 жыл бұрын
Really liked the content...thanks for sharing
@jamesbriggs
@jamesbriggs 2 жыл бұрын
Thanks for watching!
@anantzen171
@anantzen171 Жыл бұрын
10:23 I believe CLIP is an abbreviation of Contrastive Language Image Pretraining
@Gabriel-ey5ky
@Gabriel-ey5ky 2 жыл бұрын
Great video really ! I have just one thing to say, you should let the images longer in the screen I had to pause the video multiple times to be able to understand them
@jamesbriggs
@jamesbriggs 2 жыл бұрын
Thanks Gabriel, I head the same from another viewer - will do this going forwards :)
@PurpleRivar
@PurpleRivar Жыл бұрын
Thanks. It is very informative. Can you pls explain and teach us how to do fine tunning on the custome dataset. Pls
@mvrdara
@mvrdara 2 жыл бұрын
Excellent explanation! We can build a KZbin video search engine powered by clip, perhaps you can iterate on the Nlp KZbin search video you did?
@jamesbriggs
@jamesbriggs 2 жыл бұрын
That's a great idea, but it might be difficult for KZbin videos where it is just someone talking, as the image embedding would just be something like "a person talking" Possibly it could be interesting to embed both the text + images with CLIP, and maybe even an averaged text+image embedding for parts of videos where both the speech + image are important. I will think about this more, it's a great idea so thankyou!
@sharanbabu2001
@sharanbabu2001 2 жыл бұрын
Nice explanation!
@shaheerzaman620
@shaheerzaman620 2 жыл бұрын
fantastic stuff!
@dancinghoka
@dancinghoka 11 ай бұрын
Thanks a lot!
@pyalgoGPT
@pyalgoGPT 2 жыл бұрын
Plz post on Deep Reinforcement Learning tutorials & projects with python !
@jamesbriggs
@jamesbriggs 2 жыл бұрын
Eventually I’m sure I will, RL is very cool
@debayudhmitra9432
@debayudhmitra9432 7 ай бұрын
can you give the github code please
@mackenzieclarkson8322
@mackenzieclarkson8322 7 ай бұрын
Transitions are too flashy and triggering to my eyes. Good explainer however.
@davide0965
@davide0965 3 күн бұрын
Too much talk and very few illustrations
OpenAI's CLIP for Zero Shot Image Classification
21:43
James Briggs
Рет қаралды 13 М.
How AI 'Understands' Images (CLIP) - Computerphile
18:05
Computerphile
Рет қаралды 215 М.
Увеличили моцареллу для @Lorenzo.bagnati
00:48
Кушать Хочу
Рет қаралды 8 МЛН
А я думаю что за звук такой знакомый? 😂😂😂
00:15
Денис Кукояка
Рет қаралды 4,9 МЛН
УДИВИЛ ВСЕХ СВОИМ УХОДОМ!😳 #shorts
00:49
HARD_MMA
Рет қаралды 4,1 МЛН
Semantic Chunking for RAG
29:56
James Briggs
Рет қаралды 26 М.
Fast Zero Shot Object Detection with OpenAI CLIP
29:32
James Briggs
Рет қаралды 11 М.
Fast intro to multi-modal ML with OpenAI's CLIP
22:54
James Briggs
Рет қаралды 13 М.
OpenAI CLIP: ConnectingText and Images (Paper Explained)
48:07
Yannic Kilcher
Рет қаралды 136 М.
ML Was Hard Until I Learned These 5 Secrets!
13:11
Boris Meinardus
Рет қаралды 341 М.
Text to Image Diffusion AI Model from scratch - Explained one line of code at a time!
24:58
ChatGPT: 30 Year History | How AI Learned to Talk
26:55
Art of the Problem
Рет қаралды 1,1 МЛН
OpenAI’s CLIP explained! | Examples, links to code and pretrained model
14:48
AI Coffee Break with Letitia
Рет қаралды 38 М.
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
OpenAI's New GPT 3.5 Embedding Model for Semantic Search
16:15
James Briggs
Рет қаралды 72 М.