After 2 years I made a new video explaining diffusion models from a different angle. I find this approach much better to understand: kzbin.info/www/bejne/eGXSeX2meq91d6M
@Тима-щ2ю3 ай бұрын
Are diffusion models really so hard to understand?
@outliier3 ай бұрын
@@Тима-щ2ю you tell me
@psycheguy5032 ай бұрын
after spending 2 hours taking notes and understanding the 30 mins video, and scrolled down to find this x))
@ulamss52 жыл бұрын
Explaining the notations is a game changer... more educational content channels should do this.
@akashprajapathi60566 ай бұрын
Understanding math easier than its notation used 😂😂😂😂😂
@AdmMusicc10 ай бұрын
This was the best ML paper review I have ever seen. You stopped making videos but I would really love to see you go through more of this for more research in the field man! Hatsoff to you.
@Chadpritai3 ай бұрын
music and ML >>>
@-long- Жыл бұрын
For those who are confused about the recursive expansion at 13:13 (like I did), it's "a property of Gaussian distributions, where the variance of the sum of two independent Gaussian variables is the sum of their variances. "
@herrbonk3635 Жыл бұрын
I'm confused about the notation q(Xt|Xt-1) and p(Xt-1|Xt). Never seen the result of a function presented as part of the argument before. Not even sure I understood which is which from his prose.
@yogeshsingular11 ай бұрын
Seems to follow from uncorrelated noise variables at different steps, using the formula var(X1+X2)=var(X1)+var(X2)+2cov(X1,X2) where cov(X1,X2)=0. We don't seem to need to use normality here
@AICoffeeBreak2 жыл бұрын
This is incredible! Did not see a video with the math explanations of diffusion models yet. And you animated it in manim! Just great. 😎
@outliier2 жыл бұрын
thank you so much! actually it's not even animated with manim. It's all done in Premiere Pro haha. But I guess that I'll definitely do those things in manim in future videos....
@leif1075 Жыл бұрын
@@outliier Thanks for sharing bit how do ppl.not get bored and frustrated during the math lart..even if you are a math genius..and if you don't think of the smweird step of taking out the first term of the sum..can't you still reach the same goal? So why do thst at all?
@NicholasRenotte2 жыл бұрын
Wow, this is absolutely brilliant. Massive kudos for making quite the complex topic significantly more digestible!
@sauvage_pikachu2 жыл бұрын
Hey, thanks very much for making this wonderful video! I just want to appreciate the fact that all notations are clearly explained before going into the math part. That helps a lot! Great work!
@christiandeverall56612 жыл бұрын
I've watched a bunch of videos trying to understand Diffusion (Ari Seff, Assembly AI etc) and this one taught me the most by far. Please keep making videos!
@Nixo_0112 күн бұрын
I am so glad I found this video. I have a more simplified understanding of diffusion models now. Please keep posting such informative and easy to understand content.
@brianpulfer41592 жыл бұрын
This is the first ever video of you that I get to see. Congrats, truly amazing. I believe you are among the first people on YT to dig into the math equations of ML papers like this, and I believe it's truly valuable. Keep it up!
@felixvgs.98402 жыл бұрын
What an amazing video!! I looked everywhere for a comprehensible video about Diffusion Models and yours was simply the best… Please keep up the effort and the great content :)
@vladi21k2 жыл бұрын
After going through 4 different YT videos, yours was the only one that was clear enough for me to understand. Thank you very much!
@ravindrabisram1372 жыл бұрын
This is the first source I was able to find that explained the math behind diffusion models in a comprehensible way instead of glossing over it. Thanks a lot, you have earned my like and subscribe with just this video alone!
@SaulRamirez-x6e Жыл бұрын
This video is amazing. I think the format of your video was incredible, you went over the literature and told us how we got there, you went over the high-level explanation then got into the nitty-gritty detail and then just in case we miss something you gave an amazing recap. This is how all videos on deep learning should be. Especially as we're getting into more Niche topics.
@codingblaze4611 Жыл бұрын
Nicely explained. Most of the people leave these derivatives thinking it would make the tutorial boring but without these derivativation we don't understand how was the methodology evolved. Great job reasearching and explaining.
@checkout83522 жыл бұрын
Superb work. 1. Gone through the history of diffusion of models by explaining all the previous papers. 2. Giving an intuition of whole idea. 3. Explaining math behind it. 4. Also incorporating future prospects
@ryanl1988 Жыл бұрын
This is my first time leaving a comments under a ML tutorial YT channel. The explanation is amazing intuitive, thanks for sharing your knowledge and creating this video!
@outliier Жыл бұрын
So nice to hear that thank you!
@cutethanks11 ай бұрын
The most clear explanation I’ve seen on YT. Much more clear than that from MIT lectures lol Many thanks
@InturnetHaetMachine2 жыл бұрын
Thank you so much for delving deep into the math. I'm an engineer (not software) and self-learning AI. The papers are unfortunately not written in the most explainable way, and even though I've taken high level math courses for my degree, the notation and terminology in the papers make it pretty inaccessible and frustrating to follow. Thanks for going through this paper, I hope you continue to make more videos.
@javiersolisgarcia Жыл бұрын
I started reading articles and looking for learning content on diffusion modelling and the notation seemed a bit difficult. However, I am only half way through this video and I can assure you that this video is a must watch. Very clear explanation, I will recommend it to anyone interested in exploring this field, congratulations on your work!
@anujshah79492 жыл бұрын
Absolute king! Your work is such an important part of this community
@akshayshrivastava972 жыл бұрын
Very well explained! You made sure to include a lot of important points others either omit or simply skim over. Thank you very much.
@aspiringmango1929 Жыл бұрын
16:24 I don't understand how you rewrote the KL divergence as the log ratio. Specifically, I don't understand how D_KL (q || p) = log(q / p). This is different from the definition of the KL divergence, which would suggest that D_KL (q || p) = integral q * log(q / p). Could someone please explain why D_KL (q || p) = log(q / p) in this case? Thank you! This was a fantastic video and your efforts are greatly appreciated!
@lukasaichberger3081 Жыл бұрын
You are right! To be precise, he should be talking about the expected value of the log ratio.
@ruofengmusictech Жыл бұрын
See the original paper arxiv.org/pdf/2006.11239.pdf page 2. The objective is to maximum the "expected" negative log likelihood. Since the expectation is calculated as integral over x_1...T rather than x_0, it'll be 1. You can think that everything the video talks about happen inside the E_q[ ... ] bracket
@krishnadave94292 ай бұрын
hey did you understand why was it done! i have the same question.Could you please share it if found?
@StephenRayner Жыл бұрын
Wow……. Haven’t read math in a while, this was explained excellently. I have a masters degree in physics but don’t do much math anymore since my degree in 2017. I really like how much detail you went into with the derivations and the pausing to ground what we are doing with some intuition. Well done man 🎉
@parhamnooranbakht905318 күн бұрын
One of the best explanations out there. Great work.
@alexanderstark322911 ай бұрын
Best explanation I've seen so far. Though notation in math derivation section is still poorly explained... I understand every step in derivation, but don't always understand what each term logically means.
@outliier11 ай бұрын
Can you give some examples? :3
@TheVarun64 ай бұрын
This is an amazing video. I've gone through many videos to get the intuition behind the diffusion model, but nothing never helped. You did a great job simplifying the entire process.
@NellyParsley2 жыл бұрын
Man, this is incredible. When I saw these equations in the paper and other sources I was like "no way I am gonna understand that".. but with this video it all makes sense. Brilliantly done, thank you so much for your work. Instant subscribe and I am going to check other content on your channel :D
@AbrarMajeedi Жыл бұрын
Easily the best video on Diffusion models. Great work!
@nikitadeshpande66432 жыл бұрын
You are the Outlier we cannot miss! Real gem. Thanks for the explanation man!
@BritskNguyen3 ай бұрын
I really like your video because instead of using a bloated set of terminologies like conditional, marginal, prior, posterior blah blah, u just nailed it down to "function". You're like the p function that denoises these condiffusion jargons :))
@outliier3 ай бұрын
@@BritskNguyen Thank you! Take a look at my latest video. I think this approach to diffusion models is even better
@aalonsobizzi7599Ай бұрын
So, so clear, thanks! Exactly what I was looking for to study for my deep generative models project
@frapbrab6642 жыл бұрын
You're the GOAT man, very great summary of diffusion
@entropica Жыл бұрын
Great expalanation, but at (16:27) there's taking an integral over dq missing when rewriting KL(q || p) ! Same at (16:57)
@TheSeamau52 жыл бұрын
Thank you so much. I actually just recently worked out a lot of this math a couple weeks ago for a model I'm building and this video would've saved me so much time. Very clear. Thank you 🙏
@crackwitz2 жыл бұрын
Would have upvoted several times. Yours is the first video I found that actually goes into the math. Others just slap it onto the screen as fact, dazzling and confusing the viewer.
@kumaranragunathan7602 Жыл бұрын
Explaining the mathematical reasoning and formulas behind the model in such detailed fashion is amazing , keep up your good work
@seriousbusiness2293 Жыл бұрын
This is one of the rare videos i wanted to like twice. Learning this in uni but im struggeling so hard, i think i am a mathy person but all those unexplained choices and variables, calculation stepps without knowing why... it made it so hard to more deeply understand the material. But your video is just perfect, referencing the sam papers but now its all more childs play and fun to stop and follow. Its almost sad you only have so few videos but at least the quality is through the roof.
@TheKkunte2 жыл бұрын
This is the best explanation I have found so far. Thank you.
@markpayton3895 Жыл бұрын
Best video on diffusion model right now because of the math derivation of everything. Thank you!
@kartikeyabhardwaj39192 жыл бұрын
this is by far the best video on diffusion models that explains the math clearly, great job!
@inakitodc6816 Жыл бұрын
just the best expanation by far I have seen in days of searching. congrats
@cleverclover7 Жыл бұрын
i just watched like 5 of these videos on this subject, specifically the math. This was the best one by far. You should teach.
@JBoy340a2 жыл бұрын
Wow! Amazing job explaining diffusion models and why they use the math they do.
@jiejie_vegetable Жыл бұрын
This is the best video I have ever watched that can explain diffusion models so clear even to someone like me :P
@riazzai9250 Жыл бұрын
The explaination about loss function, especially the part of KL divergence, is amazing! I love your video!
@sedi_rockstar74812 жыл бұрын
Just want to say thank you. I believe this is one of the most high-quality videos I have ever seen given on diffusion models! Keep it going. I have subscribed!
@outliier2 жыл бұрын
thank you so much!
@thecheekychinaman6713 Жыл бұрын
Most videos do not going into the mathematics, or are explained in a dry slideshow manner. This is really something else.
@kateyurkova6384 Жыл бұрын
Brilliant approach of lining up equations into a story, great work, thanks!
@DarshanShah8385 ай бұрын
Kudos to you. Hats off to explain such a topic with so much ease even though the math equations looks scary at first. You made it real easy. Great work
@bhavyaruparelia74317 ай бұрын
Your explanations are simply great! I do recommend you to return back to KZbin covering latest papers in this field :)
@caiocj1 Жыл бұрын
Thanks for the video. Can someone explain why we can do the KL divergence step at 19:55? To me you haven't taken the integral of the expression across all samples and there's no q(x_T|x_0) in front of the first term for example, so why can we do this?
@PythonProdigy9 Жыл бұрын
I just watched your video on diffusion models, and I am incredibly impressed with the depth of information you provided. Your explanation was clear, concise, and immensely helpful. Thank you for sharing your knowledge on this topic. I learned a lot from your video and I truly appreciate your efforts in creating such valuable content.
@JasimUsmani2 жыл бұрын
Thank you for making such a high quality video explaining the math. Often, other channels do not emphasize on the math and this video is perfectly putting light on how exactly the math fits in diffusion models. Thank you for your amazing work. Please, make more such content!
@oriyonay88252 жыл бұрын
this video is *by far* the best video on diffusion models i've seen on youtube. this was very pleasant to watch and you made everything really clear. brilliant!! i subscribed and turned on notifications :) have an amazing day :)
@HearinCantMeow9 ай бұрын
what a wonderful and thoughtful way to deliver the whole langscape of the diffusion model! Nice video! 👍
@azmihaider10 ай бұрын
The math derivation part was amazing. really good. If I could have just one note, I would've wished you spoke a bit slower, just a tiny bit. But truly great work, much appreciated and waiting for more content.
@uslessfella10 ай бұрын
19:57 how can it be KL divergence? there has to be a term outside of log for it to be KL divergence?? can you explain this?
@andyfeng62 жыл бұрын
Thank u for the detailed explaination, looking forward for your pytorch implementation video!
@xiaohaolin6464 Жыл бұрын
Excellent video! Very clear derivation, and good animation. You are a good teacher with loads of patience, and guided us step by step!
@itsnotthattough7588 Жыл бұрын
Thanks for the simple but detailed explanation! I wouldn't be able to understand the topic without your video.
@icejust9195 Жыл бұрын
I really like your math part! Please keep going amazing work!
@dvirhanum9530 Жыл бұрын
When the math part started I went to continue watching at the toilet
@yulongtian77834 ай бұрын
you are not the only one
@hieuaovan71017 ай бұрын
love to see more good explaination for other model, your explaination is soo good
@autkarsh883010 ай бұрын
Thanks, the video was really helpful, it gave me such a great time in understanding diffusion models, kudos and keep on making such quality content!
@curiousseeker3784 Жыл бұрын
I remember coming at this video a month ago to understand diffusion models, getting overwhelmed and lost by te scary tons of maths formulae, Now after reviewing the necesary math concept, Realized how beautifully you've put it all together....Amazing
@curiousseeker3784 Жыл бұрын
OMG this is insanely complex thing i've ever learned yet in ML/AI and tho I see I still gotta spend some time in it but kuddos u've done a super amazing job!
@outliier Жыл бұрын
Thank you so much, super happy the video helped you!!!
@curiousseeker3784 Жыл бұрын
@@outliier brother there's a slight confusion. In Algo#2 , we already sampled a random noise x_t , and remove a predicted noise to obtain x_t-1, then why do we add another random noise z and what is even that z for ?
@outliier Жыл бұрын
@@curiousseeker3784 when you have x_t and you predict the noise you get an approximation for x0. This however doesn’t look so good, thats why you add noise again until x_t-1 and then repeat the process. So you have an iterative sampling process.
@SteveSperandeo2 жыл бұрын
Excellent presentation. Great balance between depth and succinctness. Thank you!
@timforcade10292 жыл бұрын
Many thanks for this. I'm an artist with very limited math skills and though I can't say I understood the whole, your teaching gave me a solid basis and an understanding of this I've been wanting. You have another fan.
@yyq902 жыл бұрын
So satisfied to know that we just need to predict the noise!!! After so many formulars...🙏🙏🙏
@NinadDaithankar57 ай бұрын
Amazing video; thanks a lot for going in depth on the math with simplified animations!
@yogeshsingular11 ай бұрын
Really great video. We need more videos like this. Helped me understand cryptic papers which can be very frustrating...
@rma15637 ай бұрын
Appreciate the effort you put into this. You definitely can teach. If only I have a brain to understand math... still got some bits here and there. Thanks
@srinathkumar1452 Жыл бұрын
Wow this is such a fantastic explanation. I love how you describe the intuitions behind the authors' mathematical choices.
@wdabrilvi Жыл бұрын
I was just using those tools to generate images but due to this video i got a lot more interested in understanding how they work. I hope you keep doing this kind of videos.
@sanjaybhandari24872 жыл бұрын
Hopping for more great contents .
@RezaSoumi Жыл бұрын
Thank you. Your explanation has been profoundly enlightening and exceptionally lucid, providing me with a comprehensive understanding.
@PakkaponPhongtawee2 жыл бұрын
Amazing! The visualization is great and easy to follow.
@bayesianmonk Жыл бұрын
You have a superpower of explaining math. Really enjoyed it.
@williamdevena8565 Жыл бұрын
Great Video! Hands down the best explanation of DDPM’s math
@chiscoduran95172 жыл бұрын
Just the video that I needed, thanks so much!!!
@elisawarner79422 жыл бұрын
Thank you so much for making this video! It was very clear and I really appreciate how you walked through the math and the reasoning for how they went from the initial loss to writing it in terms of predicting the noise. Everything was well made. I look forward to watching your other videos!
@pengxiaohan33712 жыл бұрын
Nice explaination in Math. Rarely see a such detailed diffusion model explaination video. Good job and thanks
@debajyotisg2 жыл бұрын
@19:57 How do we get the KL divergence terms? Isn't there supposed to be a expectation integral/sum somewhere?
@afrozenator Жыл бұрын
A few comments below, Outlier posted: ``` Im going to cite a friend here: "During training we sample a batch of data from a distribution with probability p. So the global function to be optimized is a summation over the dataset p*log(p/q), which is an expectation of log(p/q) by definition." ```
@garyfeng9528 Жыл бұрын
you should create more of this videos...they are just so good... It must been time consuming. Maybe consider make some smaller topics or split one big topic into more videos. AMAZING JOB. I believe a high school can get the main points from this! GJ!
@outliier Жыл бұрын
Thank you so much! The next video is on the way!
@Steveineiter2 жыл бұрын
One of the best explanations here on KZbin - thank you very much! 🥳
@kosmar3714 Жыл бұрын
Thanks for the video, very neat explanation. May I suggest, when you explain the forward process the second equation in 13:02 is q(x_{t}|x_{t-2}) ... up to q(x_{t}|x_{0}) for the final formula. Also the derivation of the chain rule is not entirely obvious, it took me some time to find the answer. The answer is that the variance of summation of the two normal gaussians is equal to the sum of variances. This is how you get rid of the square root and the sum of variances give the expected result of 1 - a_{t}a_{t-1}...a_{1}.
@mousamustafa10428 ай бұрын
U really liked that you showed the derivation in an understandable way
@glatteraal26782 жыл бұрын
Hey, really awesome question! subscribed! But I have a problem I can't wrap my head around: at 13:10 when we go from x_t-1 to x_t-2: I understand the left hand side of the equation but can someone explain me why the right hand side is sqrt(1 - alpha_t * alpha_t-1) * epsilon? If you just substitute x_t-1 in the equation above I thought we would end up with : (sqrt(1 - alpha_t) * epsilon + sqrt(1 - alpha_t-1) * epsilon). I understand that its supposed to "merge" the variance of two gaussian distributions but I just dont understand how you end up with the right hand side, if anyone could explain this to me I would be so thankful!!!
@marcella.astrid2 жыл бұрын
In this part, I also tried to derive the formula but can't get it too. My derivation of the right hand side (the epsilon part) ended up to (sqrt(alpha_t - alpha_t alpha_{t-1} + sqrt(1-alpha_t)) epsilon Unless sqrt(a)+sqrt(b) = sqrt(a+b) (which is not true), I also can't get the sqrt(1-alpha_t alpha_{t-1}). I wonder what I am missing
@glatteraal26782 жыл бұрын
@@marcella.astrid this is the first time for me having a discussion over math on youtube. I will try to look into it. I found some rule empirically that shows that this acctualy is true, if you merge two gaussians, the second expectation is just sampled from the first gaussian with a certain factor, then the factor goes into the veriance of the new distribution. I actually made a jupyter notebook to try it with all kind of values I could send it to you if you want, but I still did not found the underlying rule that explains it. asked a lot of math students in real life but either they are too busy or dont know this rule too.
@samuelbeaussant30972 жыл бұрын
@@glatteraal2678 Is this derivation from the original paper ? Cause it seems odd if not wrong
@trellas36892 жыл бұрын
I have explained it in another comment. The thing is that the epsilons are different normal distributions and cannot be threated as the same. You have to use some propertied of the normal distribution to end up with the formula.
@xzzit Жыл бұрын
@@marcella.astrid Recall that when we merge two Gaussians with different variance, e.g. sigma1^2 and sigma2^2, the variance of new distribution is (sigma1^2+sigma2^2). In this example, the right hand side equals to sqrt(alpha_t - alpha_t alpha_{t-1}) epsilon + sqrt(1-alpha_t) epsilon, which are two Gasussians merged together. The new variance is therefore, alpha_t - alpha_t alpha_{t-1} + 1 - alpha_t = 1 - alpha_t alpha_{t-1}
@spiritual-Aatma Жыл бұрын
Video is really well made. You did well to summarize to keep things simple and explanatory.
@MK-yj7pn Жыл бұрын
Fantastic video, man. Explained the stuff really really well. Thanks.
@Magnify.2 жыл бұрын
Great video, thank you for this!
@yuhonglin88982 жыл бұрын
Thanks for the fantastic introduction!! Well made video!
@Techning Жыл бұрын
Thank you for this amazing and helpful video! It was a good entry point for me on my way to move from GANs to Diffusion Models for my future research during my PhD.
@outliier Жыл бұрын
I love to hear that! Good luck with your PhD!
@DiTo972 жыл бұрын
Could you elaborate more the chanining of alphas in the forward process, q(x_t|x_{t - 1}), from 13:13 onwards?
@FLLCI2 жыл бұрын
Absolutely brilliant coverage! Keep up the good work. You are helping a lot of people.
@yahuiz78772 жыл бұрын
awesome explanations!! look forward to more brilliant tutorial/explanation vids!!
@sergiomanuel22062 жыл бұрын
The video is perfect! Thank you so much. You helped me to understand better all the formulation! Thanks again!!
@statixvfx17932 жыл бұрын
Great explanation, thank you for sharing your knowledge! Subscribed!
@stevemurch32452 жыл бұрын
Very well done. Animations are super helpful and the math explanation is clear.
@fahim786112 жыл бұрын
Greatly explained the papers and it's depend topics 👏👏👏