Umar Jamil

5:46:05

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

4 ай бұрын

1:00:15

ML Interpretability: feature visualization, adversarial example, interp. for language models

7 ай бұрын

1:15:39

Kolmogorov-Arnold Networks: MLP vs KAN, Math, B-Splines, Universal Approximation Theorem

7 ай бұрын

48:46

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

8 ай бұрын

2:15:13

Reinforcement Learning from Human Feedback explained with math derivations and the PyTorch code.

9 ай бұрын

1:14:29

Mamba and S4 Explained: Architecture, Parallel Scan, Kernel Fusion, Recurrent, Convolution, Math

11 ай бұрын

1:26:21

Mistral / Mixtral Explained: Sliding Window Attention, Sparse Mixture of Experts, Rolling Buffer

11 ай бұрын

1:12:53

Distributed Training with PyTorch: complete tutorial with cloud infrastructure and code

Жыл бұрын

50:55

Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training

Жыл бұрын

49:24

Retrieval Augmented Generation (RAG) Explained: Embedding, Sentence BERT, Vector Database (HNSW)

Жыл бұрын

54:52

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Жыл бұрын

5:03:32

Coding Stable Diffusion from scratch in PyTorch

Жыл бұрын

3:04:11

Coding LLaMA 2 from scratch in PyTorch - KV Cache, Grouped Query Attention, Rotary PE, RMSNorm

Жыл бұрын

1:10:55

LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU

Жыл бұрын

42:53

Segment Anything - Model explanation with code

Жыл бұрын

26:55

LoRA: Low-Rank Adaptation of Large Language Models - Explained visually + PyTorch code from scratch

Жыл бұрын

29:58

LongNet: Scaling Transformers to 1,000,000,000 tokens: Python Code + Explanation

Жыл бұрын

21:12

How diffusion models work - explanation and code!

Жыл бұрын

27:12

Variational Autoencoder - Model, ELBO, loss function and maths explained easily!

Жыл бұрын

58:04

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Жыл бұрын

2:59:24

Coding a Transformer from scratch on PyTorch, with full explanation, training and inference.

Жыл бұрын

14:01

CLIP - Paper explanation (training and inference)

Жыл бұрын

6:58

Wav2Lip (generate talking avatar videos) - Paper reading and explanation

Жыл бұрын

Пікірлер

@AbhishekRaj-q5g 12 сағат бұрын

Thanks Umar😊

@akurmustafa_ 19 сағат бұрын

Thanks for great video. Shouldn’t we use “residue=x.clone()” at 1:22:30 for residual connection. Otherwise residue variable will get updated also

@RishabhMishra-h5g Күн бұрын

For the first minibatch in off-policy learning, the ratio of offline and online log probas would be 1, right? It's only after the first minibatch pass, online policy would start producing different log probas for action tokens

@xugefu Күн бұрын

Thanks!

@praveensoni1119 Күн бұрын

"This explaination is all I need" to crack my DS interviews. I really liked the smooth buildup of Transformer concepts in this video. Thanks a lot man!!

@akaskmsskssk6927 Күн бұрын

One random afternoon last year I decided to watch the whole video, and now I have my own LLM with 1B parameter with your code. Thank you so much. Don't ever stop inspring new ai programmers! Greetings from Philippines.

@chatpatey7282 2 күн бұрын

Sir, I understood most of the content, but I'm struggling to grasp how the decoder block of UNET was designed. I tried to understand it and even attempted to write it on my own, but I couldn't manage. Could you please guide me on where I should focus or share any resources that can help me understand this better?

@Levi-AckermanYT 3 күн бұрын

Excellent video! Could you provide the notes, I would greatly appreciate it.❤

@Levi-AckermanYT 3 күн бұрын

I can't find the slides in github rpo

@nursami7842 3 күн бұрын

next explain vision transformer 🙏

@waniubaid7718 3 күн бұрын

Please make videolllama explanation in the same way❤❤❤

@waniubaid7718 3 күн бұрын

❤

@vanerk_ 3 күн бұрын

Great, as always, thank you sir!

@omerayklc900 4 күн бұрын

Hello Umar, i'm always be amazed with your videos. Can you make a video about how to run huggingface models on Google's TPU's. It's hard to understand torch_xla and documentation is not quite well. When working with billion parameter models like LLaMA 7b etc. it would be easier to run it on Colab's TPU's. Also Google has a program calling the TPU Research Cloud(TRC) and it gives us researchers a great opportunity about training or fine-tuning this billion parameter models. It would be great if there a tutorial about how to utilize the TPU's. Have a great day!

@Mohitdadhich-fn8ix 4 күн бұрын

What a great contribution Mr Umar …. Deep Respect for your work ❤

5 күн бұрын

thank you so much, great work

@hamedmc7938 5 күн бұрын

bro build GPT backend as a training course

@xian623 5 күн бұрын

worth every second

@binfos7434 5 күн бұрын

Completed it today. One word `Amazing`. Looking forward to the flash attention with Triton.

@Mohammed.1471 5 күн бұрын

Thank you so much.... Best explanation on yt i could find on this topic....🙌

@ShiouTianHsu 5 күн бұрын

Thanks!

@guneeshvats46 6 күн бұрын

Best explanation of transformers on youtube!!

@tiagojc 6 күн бұрын

Thanks!

@tiagojc 6 күн бұрын

Great video Umar, thank you so much. Any chance to you release the finetuning video to guide how to fit this model to my own dataset?

@Peaceful-er4vf 6 күн бұрын

3:39:42 哈哈哈真可爱

@赵品学 7 күн бұрын

This is ABSOLUTELY AMAZING!

@jamesx708 7 күн бұрын

The best video for learning RLHF.

@millenniumbismay382 7 күн бұрын

Thank you! It has been a life saver :) It is definitely the best explanation of such important concepts by a long way! Thank you so much.

@Sathyam_a31 7 күн бұрын

The best video explanation I have ever got on Mistral!! Thank You so much for your efforts.

@magnetpest2k7 7 күн бұрын

Thanks!

@tunicorn3551 8 күн бұрын

AI圣经

@GvK-wb2nc 8 күн бұрын

Great !!! Molto grazie !!!

@arminbiglari1043 9 күн бұрын

You are my lifesaver thank you very much

@Allen-TAN 9 күн бұрын

thanks for the masterpiece, can you make a video to talk about the recent famous model: Recurrent Memory Transformers (RMT), and how to make this new model compatible with transformers in HF.

@Army-qs5fu 10 күн бұрын

How can i train a transformer model from scractch for pdf summerization.... Is that the same procedure...???? Can you please reply. I'm a student who knows only fundamentals. And wish to knows more..

@parichehresmailian 10 күн бұрын

How can i start to deep dive into the RAG? I have studied software engineering in the university as bsc and these days i’m studying E-Commerce as msc, which is mixed up completely with Ai.. and i exactly want to continue to RAG🥺

@fortuneolawale9113 11 күн бұрын

thanks

@ruijian5693 11 күн бұрын

If you only have time to watch one video about flash attention, this one is the one.

@janigiovanni6075 11 күн бұрын

Actually watching it for the second time now, because there are so much valuable informations in here :D

@janigiovanni6075 11 күн бұрын

Great video, thank you very much for this!

@mlworks 11 күн бұрын

Brillant video on transformers with key math explanation.

@AnushkaMehta-c6b 11 күн бұрын

thanktha

@piz-qg9xp 11 күн бұрын

So it's the cat that is behind the scene kzbin.info/www/bejne/f4SxlYSZhc2mqtU. Thanks Dr Kitty

@maitreyimandal8910 12 күн бұрын

Such great content!

@Ask0ldd 12 күн бұрын

Thank you very much for all your hard work. Your channel is a goldmine. 👑

@cbr250-p6v 13 күн бұрын

pls bring a 'vision transformer architecture explained' video

@umarjamilai 13 күн бұрын

It’s explained in the first hour of my “Coding a Vision Language Model” video

@yanghelena 14 күн бұрын

Thank you for your selfless sharing and hard work! This video helps me a lot!

@AD-zj7ck 15 күн бұрын

Thanks for the amazing video. Can you make video explaining and proving the universal approximation theorem?

@Cryptic3.0 15 күн бұрын

This is gold!

@jaewanpark2570 16 күн бұрын

This absolutely is gold. Actually it's closer to diamond than gold. My favorite part is 2:44:50, 2:47:50 when the cat also feels the content is wonderful.

@harshalhirpara4589 16 күн бұрын

Thank you Umar, you video made me connect all the dots!

Ең жақсы KZbin

Пікірлер