Transformer Neural Networks - EXPLAINED! (Attention is all you need)

  Рет қаралды 831,070

CodeEmporium

CodeEmporium

Күн бұрын

Please subscribe to keep me alive: www.youtube.co...
BLOG: / dataemporium
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.ne...
📕 Calculus: imp.i384100.ne...
📕 Statistics for Data Science: imp.i384100.ne...
📕 Bayesian Statistics: imp.i384100.ne...
📕 Linear Algebra: imp.i384100.ne...
📕 Probability: imp.i384100.ne...
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.ne...
📕 Python for Everybody: imp.i384100.ne...
📕 MLOps Course: imp.i384100.ne...
📕 Natural Language Processing (NLP): imp.i384100.ne...
📕 Machine Learning in Production: imp.i384100.ne...
📕 Data Science Specialization: imp.i384100.ne...
📕 Tensorflow: imp.i384100.ne...
REFERENCES
[1] The main Paper: arxiv.org/abs/...
[2] Tensor2Tensor has some code with a tutorial: www.tensorflow...
[3] Transformer very intuitively explained - Amazing: jalammar.github...
[4] Medium Blog on intuitive explanation: / what-is-a-transformer
[5] Pretrained word embeddings: nlp.stanford.e...
[6] Intuitive explanation of Layer normalization: mlexplained.co...
[7] Paper that gives even better results than transformers (Pervasive Attention): arxiv.org/abs/...
[8] BERT uses transformers to pretrain neural nets for common NLP tasks. : ai.googleblog....
[9] Stanford Lecture on RNN: cs231n.stanford...
[10] Colah’s Blog: colah.github.i...
[11] Wiki for timeseries of events: en.wikipedia.o...)

Пікірлер: 701
@CodeEmporium
@CodeEmporium Жыл бұрын
For more details and code on building a translator using a transformer neural network, check out my playlist "Transformers from scratch": kzbin.info/www/bejne/h3Stgnpqedp7ipI
@ThisNameWasntTaken
@ThisNameWasntTaken 5 жыл бұрын
what a hugely underrated video. You did such a better job at explaining this on multiple abstraction layers in such a short video than most videos I could find on the topic which were more than twice as long.
@CodeEmporium
@CodeEmporium 5 жыл бұрын
Thanks a ton Jeffrey! Means a lot. I've come to realize (fairly recently) that only speaking in jargon isn't going to help. Pealing it down from highly abstract to more technical goes a long way for viewers and myself. I understand more when I break the jargon down. Using this more in future videos
@ThisNameWasntTaken
@ThisNameWasntTaken 5 жыл бұрын
@@CodeEmporium Of course everyone has a different approach to understanding a topic. I recently had to get into a few topics quite quickly and fore me the best way of getting there fast was to start out with very general videos to get a sort of feel for the general ideas and how everything works together on a high level. Then I would watch some more detailed videos or switch to reading more detailed articles until finally reading the actual papers and looking at the formulas and all that stuff. Having an understanding of the bigger picture helped me comprehend the details better. I Also think no explanation can ever be "too simple" cause sometimes when explanations try to save time by glossing over parts or taking things for granted you spend way more time rewinding trying to wrap your head about some small detail just because you might be missing some needed knowledge. I think in an explanation it's like with spices on food. better keep it simple and easy. Individuals can always skip parts for themselves. same with the spices: Better not add too much thinking everyone will be able to take it, rather add a little and if it's not enough for someone they can add it themselves.
@bhargavyagnik
@bhargavyagnik 4 жыл бұрын
So true man !! i scraped the net to find a simple explanation !! you are a genius :)
@joosthorskamp1736
@joosthorskamp1736 4 жыл бұрын
True, I did not understand the use of attention until watching this video
@sindhutirth
@sindhutirth 4 жыл бұрын
@@CodeEmporium Great approach that you've taken. A high level understanding followed by deeper understanding of the topic pretty much clears up the concept. Subscribed.
@PhilbertLin
@PhilbertLin 4 жыл бұрын
Great video! Watched it a few times already so these timestamps will help me out: 0:00 Problems with RNNs and LSTMs 3:34 First pass overview over transformer architecture 8:10 Second pass with more detail 10:34 Third pass focusing on attention networks and normalization 11:57 Other resources (code & blog posts)
@CodeEmporium
@CodeEmporium 4 жыл бұрын
Thanks for this! It'll help others watching too.
@akkipant
@akkipant 4 жыл бұрын
@@CodeEmporium Pin this comment.
@mattcoakes5682
@mattcoakes5682 4 жыл бұрын
Thank you so much! I planned to watch this a few times for reference as I delve into transformer code. This will be very useful.
@danbochman
@danbochman 4 жыл бұрын
Wow. I've seen lectures that are 45m+ long trying to understand this architecture, even lectures from the original authors. Your video was hands-down the best, really helped me piece some key missing intuition pieces together. You have a gift for teaching and explaining -- I wholeheartedly hope you're able to leverage that in your professional career!
@Elanus19
@Elanus19 4 жыл бұрын
Incredibly well explained and concise. I can't believe you pulled off such a complete explanation in just 13 minutes!
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Thank you for the kind words. Super glad you liked it :)
@ShotReverseShot
@ShotReverseShot 3 жыл бұрын
2:03 I died at the Vsauce reference. Well played.
@kekmeister42
@kekmeister42 3 жыл бұрын
Me too, I had to scroll back around 30s because he just continued explaining and I completely lost focus :D
@as5728-h1i
@as5728-h1i 3 жыл бұрын
:v
@gearoidmurphy4988
@gearoidmurphy4988 4 жыл бұрын
The multi-pass approach to progressively explaining the internals worked well. Thanks for your content!
@ajcosta
@ajcosta 2 жыл бұрын
The understanding converges!
@frederikbrammer
@frederikbrammer 2 жыл бұрын
This is by far the best explanation of the Transformers architecture that I have ever seen! Thanks a lot!
@lingding77
@lingding77 3 жыл бұрын
I love the multi-pass way of explanation so that the viewer can process high level concepts and then build upon that knowledge, great job.
@HIMANSHUKUMARSINHA7
@HIMANSHUKUMARSINHA7 2 жыл бұрын
I bought udemy course for Transformer and BERT but with no help and wasted my time, money and energy. This video and your BERT video made my day. thanks. I may explain in my interview well. :)
@MrKfirlevi
@MrKfirlevi 3 жыл бұрын
Great video!! I am taking a course in my university and one of the lectures was about RNNs and transformers. Your video of 13 mins explains way better than the 100 mins lecture i attended. Thank you!
@jonathanburrell7055
@jonathanburrell7055 Жыл бұрын
This is awesome!!! Thank you for breaking it down concisely, understandably, and deeply! It’s hard to find explanations that aren’t so simplistic they’re useless, or so involved they don’t save time in achieving understanding. Thank you!!
@CodeEmporium
@CodeEmporium Жыл бұрын
My pleasure! If you are into building the transformer piece by piece from scratch, I suggest checking out the “Transformers from scratch” playlist.
@snehashishpaul2740
@snehashishpaul2740 2 жыл бұрын
I had to make 2 passes of your video to fully understand and appreciate the underlying mathematics and working of the model. You have put a great effort in making it simpler to understand with illustration and animation.
@newginsam670
@newginsam670 10 ай бұрын
Bro TBH no words to appreciate such a well structured video in a short time and the explanation was easly understandable even for people with less knowledge. Thanks for the video man.
@shipan5940
@shipan5940 3 жыл бұрын
By far, the MOST comprehensible explaination on Transformer available in the whole internet space.
@shipan5940
@shipan5940 3 жыл бұрын
You deserve 1M subscribers at least.
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Thank you for the kind words! Maybe one day
@paragrk1
@paragrk1 3 жыл бұрын
Went through several videos on 'Attention is all you need' paper before this, all the details you managed to cover in thirteen minutes is amazing. Could not find explanation that is so easy to understand anywhere else. Great job!
@enjakuro7048
@enjakuro7048 3 жыл бұрын
right? I couldn't believe this video is only 13 minutes! That's a very good talent to have.
@vtrandal
@vtrandal 2 жыл бұрын
With videos like this one you should be having 100,000+ subscribers soon. Adding a bit of humor to uncompromising technical content is a very good way to go.
@adwaitpatil8300
@adwaitpatil8300 11 ай бұрын
One of the cleanest explanation for transformers without dabbling too much into the theory!! Thanks man
@finnberuldsen4798
@finnberuldsen4798 Жыл бұрын
Wonderful video, this clears up so much for me.
@CodeEmporium
@CodeEmporium Жыл бұрын
Glad! I am currently building a transformer from scratch for additional context in my playlist “Transformers from scratch “
@imagiro1
@imagiro1 Жыл бұрын
Thank you very much for you effort! You just adjusted my attention layer, the pieces start to fall in place and I have a much better understanding of why TNNs are so revolutionary.
@KrazeeKrab
@KrazeeKrab Жыл бұрын
This was a phenomenal video. You managed to explain transformers in 13 minutes better than my professor could in three hours. Thank you and keep on creating content!
@moneyinahurry
@moneyinahurry 2 жыл бұрын
One of the best or probably the best explanation I've seen. Thank you very much for the effort.
@ryanhewitt9902
@ryanhewitt9902 2 жыл бұрын
Thank you for making this! As a curious outsider I have been anxious about falling behind in recent years and this was perfect to bring me up to speed - at least enough to follow the conversation.
@CodeEmporium
@CodeEmporium 2 жыл бұрын
You are very welcome! Thanks for watching! :)
@pipe_runner_lab
@pipe_runner_lab 2 жыл бұрын
I saw Yanik's explaination and now I saw yours. Yanik does a terrible job at explaining papers, he usually just jokes around. Your explanation is probably one of the best I have seen so far. Thanks man.
@PRUTHVIRAJRGEEB
@PRUTHVIRAJRGEEB 5 жыл бұрын
This is heavily underrated!! Such an awesome video! Thanks Man!
@langlansrobert8599
@langlansrobert8599 3 жыл бұрын
The easiest understanding video about this topic, as I can see! Thank you
@CodeEmporium
@CodeEmporium 3 жыл бұрын
You are very welcome :)
@simonevagnoni1758
@simonevagnoni1758 3 жыл бұрын
Your way to breaking down step by step is very effective! Congrats and thanks. School systems should use it more
@NicholasRenotte
@NicholasRenotte 3 жыл бұрын
Oh man I’m going through this now! Didn’t even realise you had a vid on it, this is brilliant. Love that you did a bird’s eye pass then dove in.
@CodeEmporium
@CodeEmporium 3 жыл бұрын
You're saying you liked my content without watching this video? You must be a true fan of mine mwahahaa..also thanks for the kind words :)
@NicholasRenotte
@NicholasRenotte 3 жыл бұрын
@@CodeEmporium I mean, tbf, I clicked like before watching it because it's you, then I was like dayyyyuuumm this is great.
@arshiavashisht3481
@arshiavashisht3481 Жыл бұрын
Had to watch it like 2-3 times to get the entire thing, but worth it. Somehow so concise, yet every relevant detail is included.
@CodeEmporium
@CodeEmporium Жыл бұрын
Thanks a lot for watching and commenting! Super happy this video gave you value
@wangy01
@wangy01 2 жыл бұрын
I watched this video four times. After each time, I feel I understand this topic better than the previous one.
@navinahmed
@navinahmed 4 жыл бұрын
Didnt know what a transformer hype was until I landed on this video. Thanks a lot ! Subscribed. Gotta check more content on this channel now
@NoOne-sy5fg
@NoOne-sy5fg 2 жыл бұрын
Great video bro! You're underrated af. One of the best if not the best explanation of some neural network architecture. Keep up the good work. Kudos!
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thanks so much! I am making more related videos. So do check ‘em out :)
@ScriptureFirst
@ScriptureFirst 4 жыл бұрын
KEEP IT UP! Please! This is outstanding :) love the diagrams, dry crisp active speaking, & overview technique.
@seungjungjin9217
@seungjungjin9217 4 жыл бұрын
Great video! I love how you go through multiple passes, each getting into deeper specifics!
@ahmedelayek2110
@ahmedelayek2110 3 жыл бұрын
what a guide for the transformer in just 13 min. thanks a lot for this simplicity.
@himeshph
@himeshph 4 жыл бұрын
@2:02 that vsauce thing was cool
@ShenalPerera
@ShenalPerera 4 жыл бұрын
Ikr 😅
@blakef.8566
@blakef.8566 4 жыл бұрын
...or was it?
@aryamaanjena1706
@aryamaanjena1706 4 жыл бұрын
Love that so many people got the vsauce ref
@ozgurakpinar_gr
@ozgurakpinar_gr 4 жыл бұрын
I came here to say that. Excelsior
@Sidnv
@Sidnv 3 жыл бұрын
Really great video. As someone transitioning from pure math into machine learning and AI, I find the language barrier to be the biggest hurdle and you broke down these concepts in a really clear way. I love the multiple layer approach you took to this video, I think it worked really well to first give a big picture overview of the architecture before delving deeper.
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Super happy this helped!
@karakadir8860
@karakadir8860 11 ай бұрын
dude you absolutely deserve each and any subsriber. thank you very much for your highly helpful and quality content.
@CodeEmporium
@CodeEmporium 11 ай бұрын
Thanks so much for the lovely comment! And also for subscribing! More to come!
@thegt
@thegt Жыл бұрын
This might be the best explanation for someone who has some experience with ANN and CNN but want to understand Transformers. Thanks!
@CodeEmporium
@CodeEmporium Жыл бұрын
Thanks for the kind words!
@vaishnavvaidheeswaran3692
@vaishnavvaidheeswaran3692 3 ай бұрын
I paused every 10 seconds and took notes, such an excellent video!
@dannysuarez6265
@dannysuarez6265 4 жыл бұрын
How is it possible such a beautiful video don't have more views/likes? Thank you CodeEmporium:)
@logicloudy2851
@logicloudy2851 4 жыл бұрын
Thanks, man. This is a really clear and high-level explanation. Really helpful for some guys like me who just stepped into this area. I read many explanations online. They give tons of details but fail in explaining these abstract items. These explanations always use other abstract definitions to explain this one. This problem happens again in the explanation of the "other abstract item". Sometimes I just forgot originally what I want to understand. Or even worse, they form a circulation... Thank you so much! This video helped me a lot in understanding the paper
@tarat.techhh
@tarat.techhh 4 жыл бұрын
2:04 or are we... not gonna lie this is the best channel and best explanation ever....................
@colorlace
@colorlace 4 жыл бұрын
The only thing that surprised me more than this high quality explanation is how you pronounce "mahtrix"
@CodeEmporium
@CodeEmporium 4 жыл бұрын
We live in the mahtrix... . . Sorry I'll leave. . . Psych. I'm staying since it's my channel. Haha. . . Please don't leave (Thanks for watching and reading this pointless rant)
@mattcoakes5682
@mattcoakes5682 4 жыл бұрын
LOVE the multipass strategy for explaining the architecture. I don't think I've seen this approach used with ML, and it's a shame as this is an incredibly useful strategy for people like me trying to play catch up. I hopped on the ML train a little late, but stuff like this makes me feel not nearly as lost.
@derrxb
@derrxb 4 жыл бұрын
This is one of the best explanations for transformers I've come across online! Awesome job, man! Thanks. I'll totally recommend your channel to some classmates!! :)
@CodeEmporium
@CodeEmporium 4 жыл бұрын
Thanks! And Glad it's helpful! Spreading the word of my channel is the best thing you can do to help :)
@aram69420
@aram69420 10 ай бұрын
I don't know how many attention video I have watched. But I swear I'm getting there. Repetition is the key to understanding these shit
@ApprovingSeal
@ApprovingSeal 2 жыл бұрын
Finally a basic explanation I can understand. I tried reading the original "Attention is all you need" paper, but it felt like it was assuming I was already familiar with the basics of NLP, like the encoder-decoder setup. Which I wasn't.
@cPho3nix
@cPho3nix 4 жыл бұрын
That was the best way of explaining thins in my opinion. Start big picture, getting more detailed over time.
@paramveersingh2919
@paramveersingh2919 3 жыл бұрын
Watched Andrew Ng, watched this, you got me to stick through the video and Andrew who i consider one of the best in this field did not manage to express as clearly as you did! Cheers man, amazing video!
@tictacX1
@tictacX1 4 жыл бұрын
Good job CodeEmporium! Very well made overview. thanks.
@Fruchtkotzekiddy
@Fruchtkotzekiddy 4 жыл бұрын
this video was one of the best learning videos i EVER SAW first you give a high level overview, then u step in deeper every step with an understandable example THANK YOU SO MUCH!!!
@CodeEmporium
@CodeEmporium 4 жыл бұрын
You are very welcome. Thank you for the compliments :)
@klam77
@klam77 Жыл бұрын
BEAUTIFUL......deep, concise, pithy, each word is meaningful.....well done.
@arunimachakraborty1175
@arunimachakraborty1175 Жыл бұрын
watching your videos for a last min revision. You're awesome
@CodeEmporium
@CodeEmporium Жыл бұрын
Awesome! And Thank you! I do not deserve such compliments
@bauwndule
@bauwndule 3 жыл бұрын
Best explanation ever. I have an interview with Microsoft tomorrow. This was the best brushing up I could get.
@tvanpeer
@tvanpeer Жыл бұрын
Great video! I love the layered approach for explaining the concepts. Very well done. Thank you!
@CodeEmporium
@CodeEmporium Жыл бұрын
I am super glad that approach helped
@vahekassardjian5032
@vahekassardjian5032 4 жыл бұрын
Outstanding explanations: to the point and well illustrated. Thank you.
@TheJonathanLugo
@TheJonathanLugo Жыл бұрын
Wow, I am so glad I found your channel. The concept is clearly explained and assumes an intelligent audience. Well done!
@jasminelee1720
@jasminelee1720 3 жыл бұрын
Really great video. I have taken so many classes that have discussed this model and never understood it until now. Thanks!
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Perfect! Glad this helped!
@rahulchowdhury9739
@rahulchowdhury9739 2 жыл бұрын
I am half-way through this video. I have not finished it. But I felt giving a like before watching the next half. Thank you so much.
@atwinemugume
@atwinemugume 3 жыл бұрын
I am now more confident to talk about transformers, it had been abstract to me before. Thank you so much, great explanation.
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Glad and you're welcome! :)
@gurudevilangovan
@gurudevilangovan 4 жыл бұрын
Wow. Amazing video. Better than anything I've watched on the topic, all in thirteen minutes.
@lonewolf2547
@lonewolf2547 4 жыл бұрын
I cant believe you explained so complicated things in crystal clear format. Excellent job dude
@VaibhavPatil-rx7pc
@VaibhavPatil-rx7pc Жыл бұрын
TOP OF TOP clear explanation you provided !!!
@d63810728
@d63810728 4 жыл бұрын
This is by far the most comprehensive yet short video i haave seen
@mr.dineshlee
@mr.dineshlee 2 жыл бұрын
I watched many videos related to this topic, but this video taught me easiest way...
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thanks for watching! And for the compliments :) I try my best for my more recent videos too
@emeralde3761
@emeralde3761 2 жыл бұрын
Just want to say this video is amazing. Watched like three other 30+ mins videos but they all failed to train my stupid brain. This 13 minutes video is intuitive, detailed, and beginner-friendly. Thank you :3
@lisandrocesaratto3012
@lisandrocesaratto3012 3 жыл бұрын
Best video on Transformers I have seen so far! The examples really help to understand how the architecture works. Subscribed.
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Thank you for watching!
@Mewgu_studio
@Mewgu_studio 2 жыл бұрын
Really appreciate how the explanation is going from high level and down, thanks.
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Super glad you enjoyed this :)
@pathikghugare9918
@pathikghugare9918 2 жыл бұрын
Its been a year I still come back to this video to revise my transformer knowledge :) Thanks man!
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thanks for coming back :)
@arrasuraparamesh1090
@arrasuraparamesh1090 2 жыл бұрын
A very good explanation about transformers, Thank you :)
@teetanrobotics5363
@teetanrobotics5363 5 жыл бұрын
Best Channel for ai and ml on KZbin
@ganjarulez009
@ganjarulez009 2 жыл бұрын
Really nice explanation, allthough even after the video most of Transformers still seem like a big magic black-box. Fascinating stuff
@bloolizard
@bloolizard 4 жыл бұрын
Good stuff, best explanation found so far, had been so confusing reading through the jargon on other sites.
@CodeEmporium
@CodeEmporium 4 жыл бұрын
Glad this was useful. The idea was to avoid as much jargon as possible. And if used, make sure it's explained
@danielpwagner
@danielpwagner 2 жыл бұрын
Outstanding. Best explanation of transformers that I’ve seen by far.
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thanks a ton for watching :)
@anandg4286
@anandg4286 Жыл бұрын
Thanks!
@CodeEmporium
@CodeEmporium Жыл бұрын
Thanks for the donation! And you are very welcome! :)
@Themojii
@Themojii 2 жыл бұрын
Very well, clearly explained the concepts, and nice visualization video. I spent a couple of hours and read multiple blog/tutorial about transformers, but I learned a lot more from your 13 mins video compared to those tutorials. Great job. I subscribed to your channel after watching this. Keep up the good work
@anirudha_ani
@anirudha_ani 3 жыл бұрын
Best explanation ! "Multi pass" way of explaining is genius.
@alexanderk5835
@alexanderk5835 3 жыл бұрын
Hey, thanks a lot, the explanation is great, the video explains much clearer than the lecture in the university.
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Thanks for the high praise
@theneilpowers
@theneilpowers Жыл бұрын
This earned a subscription! Excellent explanation!
@prateek4546
@prateek4546 2 жыл бұрын
Wow, the best explaination on youtube ! Had to subscribe after watching !
@M0I0D
@M0I0D 4 жыл бұрын
너무 감사합니다ㅠㅜㅜ 덕분에 확 이해가네요. 배뎃 쌉공감 대체 이거 왜이렇게 안 유명하냐!!! thank you for your clear explanation!! This is what I was looking for!!!!!!
@ritwikdubey5331
@ritwikdubey5331 Жыл бұрын
I was searching for this particular explanation from a long tym! thanks for this!
@autismo1969
@autismo1969 Жыл бұрын
that was an amazing content heavy explanation for just 13 minutes. thanks a lot!
@CodeEmporium
@CodeEmporium Жыл бұрын
You are super welcome!
@goncalomarques251
@goncalomarques251 4 жыл бұрын
Sir, you just earned yourself a like and subscription for this amazing video. My background is Mechanical Engineering, but I was still able to easily follow each step. Thanks man!
@CodeEmporium
@CodeEmporium 4 жыл бұрын
Perfect! Glad this was useful to you. And thanks for your background info. Helps to know my audience to tailor these videos
@ishwarinalgirkar1193
@ishwarinalgirkar1193 Жыл бұрын
great job putting attention in simple words. Very intuitive!
@CodeEmporium
@CodeEmporium Жыл бұрын
Thanks so much!
@TusharKale9
@TusharKale9 3 жыл бұрын
Superb explanation in 13 minutes. I have been watching videos over 1 hour long to get this concept. Well done and keep it up. Regards
@praveenrajagopal9161
@praveenrajagopal9161 3 жыл бұрын
Brilliant piece of work! You nailed it in few mins!
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Thanks for watching!
@rodrigoklosowski8219
@rodrigoklosowski8219 4 жыл бұрын
Amazing! The best explanation on youtube about Transformer Neural networks, the matrices visual representations helps a lot!
@Ashwin436
@Ashwin436 3 жыл бұрын
This was very helpful! You broke it down into such simpler concepts. I'm sure I'll be needing you again. Please keep at it. Thanks!
@eashwaraerahan861
@eashwaraerahan861 4 жыл бұрын
Great work. I really like the info graphics. I’m a person with no background on NLP but I was still able to follow till the second pass, thanks to your great work.
@usamahussain4461
@usamahussain4461 2 жыл бұрын
Man! Brilliant video. I saw a 27 mins video and was totally spent out and didn't even understand much. But this was just awesome, and in half the time! The only thing lacking might be the examples of keys, values and queries but i mostly got the hang of it.
@melongarb
@melongarb Ай бұрын
Ah, so that’s what Q, V, and K are.
@gokuson6399
@gokuson6399 3 жыл бұрын
Best explanation so far! Keep up the good work!
@utsabkhakurel9742
@utsabkhakurel9742 Жыл бұрын
Simple and easy to understand. Great job!
@firesongs
@firesongs 2 жыл бұрын
Even though this definitely assumes some intermediate working level knowledge of ML, best layman explanation on Transformers/Attention so far.
@abhishekbhatia651
@abhishekbhatia651 4 жыл бұрын
How do you pass the output tokens when they are something we want to predict? I think you pass whatever is generated as output, and since nothing is generated in the start, you pass the token.
@graywire1684
@graywire1684 3 жыл бұрын
Best channel on the topic! Soo glad that I found this channel
@CodeEmporium
@CodeEmporium 3 жыл бұрын
Thanks a ton :)
@sairamsubramaniam8316
@sairamsubramaniam8316 4 жыл бұрын
This is the best explanation! I came in search for transformers but I found Gold.
@animist_avery
@animist_avery 3 жыл бұрын
This video presents such a natural logical flow. It is very satisfying to watch. Could someone help me with some answers to these questions? Answers to these would really help orient me so I could then understand the deeper points of the video. 1) This process of feeding the English and the French words is training right? (as opposed to the process of using the model to calculate a translation desired by the user) 2) Assuming that yes, this video is about training, at what point do you say to the algorithm "The next French word was actually supposed to be 'chien'. Learn from that!" 3) Is this video an example of an NSP training? How would it look different if it were MLM? I know what MLM is, but I'm asking more from a practical standoint, like what would you feed to the algorithm and at what time. Really great video nonetheless, I can tell it would be perfect if I just had a little more background (which I am working to develop)
@leoisikdogan
@leoisikdogan 5 жыл бұрын
Very well explained! Great video, as always.
BERT Neural Network - EXPLAINED!
11:37
CodeEmporium
Рет қаралды 423 М.
Transformers Explained | Simple Explanation of Transformers
57:31
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН
Inside a Swiss duo's ambitious mission to clean up space
12:47
CNBC International
Рет қаралды 1,9 М.
Multi Head Attention in Transformer Neural Networks with Code!
15:59
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 596 М.
The math behind Attention: Keys, Queries, and Values matrices
36:16
Serrano.Academy
Рет қаралды 284 М.
10 weird algorithms
9:06
Fireship
Рет қаралды 1,3 МЛН
Attention Is All You Need
27:07
Yannic Kilcher
Рет қаралды 671 М.
Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!
36:15
StatQuest with Josh Starmer
Рет қаралды 818 М.
Attention in transformers, step-by-step | DL6
26:10
3Blue1Brown
Рет қаралды 2,1 МЛН
How I'd learn ML in 2025 (if I could start over)
16:24
Boris Meinardus
Рет қаралды 230 М.
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН