Layer Normalization in Transformers | Layer Norm Vs Batch Norm

  Рет қаралды 8,464

CampusX

CampusX

Күн бұрын

Layer Normalization is a technique used to stabilize and accelerate the training of transformers by normalizing the inputs across the features. It adjusts and scales the activations, ensuring consistent output distributions. This helps in reducing training time and improving model performance, making it a key component in transformer architectures.
Share your thoughts, experiences, or questions in the comments below. I love hearing from you!
============================
Did you like my teaching style?
Check my affordable mentorship program at : learnwith.campusx.in
DSMP FAQ: docs.google.com/document/d/1O...
============================
📱 Grow with us:
CampusX' LinkedIn: / campusx-official
CampusX on Instagram for daily tips: / campusx.official
My LinkedIn: / nitish-singh-03412789
Discord: / discord
E-mail us at support@campusx.in
✨ Hashtags✨
#deeplearning #campusx #transformers #transformerarchitechture
⌚Time Stamps⌚
00:00 - Intro
02:20 - What is Normalization
03:50 - What do we normalize?
05:30 - Benefits of Normalization in DL
07:10 - Internal Covariate Shift
12:49 - Batch Normalization Revision
22:56 - Why don't we use Batch Norm in Transformers?
38:25 - How does Layer Normalization works?
43:00 - Layer Normalization in Transformer

Пікірлер: 101
@abhisheksaurav
@abhisheksaurav Ай бұрын
This playlist is like a time machine. I’ve watched you grow your hair from black to white, and I’ve seen the content quality continuously improve video by video. Great work!
@animatrix1631
@animatrix1631 Ай бұрын
I feel the same as well but I guess he's not that old
@zerotohero1002
@zerotohero1002 5 күн бұрын
Courage comes at a price ❤
@muhammadsheraz177
@muhammadsheraz177 Ай бұрын
Please end this playlist as early as possible
@RamandeepSingh_04
@RamandeepSingh_04 17 күн бұрын
Another student added in the waiting list demanding for next video. Thank you sir.
@ayushrathore2570
@ayushrathore2570 26 күн бұрын
This whole playlist is the best thing I discovered on KZbin! Thank you so much, sir
@yashshekhar538
@yashshekhar538 26 күн бұрын
Respected Sir, your playlist is the best. Kindly increase the frequency of videos.
@AidenDsouza-ii8rb
@AidenDsouza-ii8rb 2 күн бұрын
Your DL playlist is like a thrilling TV series - can't wait for the next episode! Any chance we could get a season finale soon? Keep up the awesome work!
@akeshagarwal794
@akeshagarwal794 Ай бұрын
Congratulations for building a 200k Family you deserve even more reach🎉❤ We love you sir ❤
@rajnishadhikari9280
@rajnishadhikari9280 Ай бұрын
Thanks for this amazing series.
@amitmehraa
@amitmehraa 9 күн бұрын
please complete this this playlist and add transformers tutorials as soon as possible
@shreeyagupta5720
@shreeyagupta5720 Ай бұрын
Congratulations for 200k sir 👏 🎉🍺
@vinaykumar-xh5pi
@vinaykumar-xh5pi 6 күн бұрын
please release the next video very curious to complete ...... loved your content as always
@sahil5124
@sahil5124 Ай бұрын
this is really important topic. Thank you so much. Please cover everything about Transformer architecture
@user-nc8nc3lj1c
@user-nc8nc3lj1c 25 күн бұрын
Sir try to complete this playlist as early as possible , you are the best teacher and we want to learn the deep learning concept from you
@krisharora2959
@krisharora2959 4 күн бұрын
Next video is awaited more than anything
@arpitpathak7276
@arpitpathak7276 Ай бұрын
Thank you sir I am waiting for this video ❤
@GanitSikho-xo2yx
@GanitSikho-xo2yx 13 күн бұрын
Well, I am waiting for your next video. It's a gem of learning!
@ghousepasha4172
@ghousepasha4172 9 күн бұрын
Please sir update videos regularly, we wait a lot for your videos
@ai_pie1000
@ai_pie1000 Ай бұрын
Congratulations Brother for 200k users Family ... 👏👏👏
@mayyutyagi
@mayyutyagi Ай бұрын
Amazing series full of knowledge...
@rb4754
@rb4754 Ай бұрын
Congratulations for 200k subscribers!!!!!!!!!!!!!!!!!!
@bmp-zz9pu
@bmp-zz9pu 8 күн бұрын
SIr krdo pls iss playlist ko poora!!!!!!!!!
@sachink9102
@sachink9102 11 күн бұрын
Thank you, NitishJi, Eeagerly waiting to attend your Transformers sessions. Please complete this series.
@nvnurav1892
@nvnurav1892 6 күн бұрын
Sir one small suggestion, aap apni videos pe speech to speech translation laga ke english mai convert kar lo and upload it on Udemy/youtube. it will help a lot of people jinko hindi nhi aati and will help your hard work get more and more attraction.🙂🙂. We are really very lucky that we are getting such rich content for free.. God bless you.
@znyd.
@znyd. Ай бұрын
Congrats on the 200k subs, love from Bangladesh ❤.
@dharmendra_397
@dharmendra_397 Ай бұрын
Very nice video
@muhammadsheraz177
@muhammadsheraz177 Ай бұрын
Sir kindly can you tell that when this playlist will complete.
@hassan_sid
@hassan_sid Ай бұрын
It would be great if you make a video on RoPE
@shibrajdeb5177
@shibrajdeb5177 25 күн бұрын
sir please upload regular video . This videos help me a lot. please sir upload regular videos
@AmitBiswas-hd3js
@AmitBiswas-hd3js 14 күн бұрын
Please cover this entire Transformer architecture as soon as possible
@1111Shahad
@1111Shahad 25 күн бұрын
Thank you Nitish, Waiting for your next upload.
@not_amanullah
@not_amanullah 24 күн бұрын
This is helpful 🖤
@taseer12
@taseer12 Ай бұрын
Sir I can't describe your efforts Love from Pakistan
@rose9466
@rose9466 Ай бұрын
Can you give an estimate by when this playlist will be completed
@not_amanullah
@not_amanullah 24 күн бұрын
Thanks ❤
@saurabhbadole821
@saurabhbadole821 27 күн бұрын
I am glad that I found this Channel! can't thank you enough, Nitish Sir! One more request: If you could create one-shot revision videos for machine learning, deep learning, and natural language processing (NLP).🤌
@princekhunt1
@princekhunt1 3 күн бұрын
Sir, Please complete this series.
@Amanullah-wy3ur
@Amanullah-wy3ur 15 күн бұрын
thanks ❤
@gurvgupta5515
@gurvgupta5515 28 күн бұрын
Thanks for this video sir. Can you also make a video on Rotary Positional Embeddings (RoPE) that is used in Llama as well as other LLMs for enhanced attention.
@sachinpatodia4593
@sachinpatodia4593 2 күн бұрын
Can you pls explain what is the add in add and norm layer?
@advaitdanade7538
@advaitdanade7538 Ай бұрын
Sir please end this playlist fast placement season is nearby😢
@physicskiduniya8054
@physicskiduniya8054 16 күн бұрын
Bhaiya! Awaiting for your course upcoming videos please try to complete this playlist asap bhaiya
@29_chothaniharsh62
@29_chothaniharsh62 Ай бұрын
Sir can you please continue the 100 interview questions on ML playlist?
@WIN_1306
@WIN_1306 15 күн бұрын
at 46:10 ,why it is zero? as beta is added so it will prevent it from becoming zero?
@WIN_1306
@WIN_1306 15 күн бұрын
i am the 300th person to like this video sir plzz upload next vidoes we are eagerly waiting
@MrSat001
@MrSat001 Ай бұрын
Great 👍
@SulemanZeb.
@SulemanZeb. Ай бұрын
Please start MLOPs playlist as we are desperately waiting for.......
@darkpheonix6592
@darkpheonix6592 10 күн бұрын
please upload remaining videos quickly
@teksinghayer5469
@teksinghayer5469 Ай бұрын
when will you code transformer from scratch in pytorch
@technicalhouse9820
@technicalhouse9820 Ай бұрын
Sir love you so much from Pakistan
@sagarbhagwani7193
@sagarbhagwani7193 Ай бұрын
thanks sir plse complete this playlist asap
@intcvn
@intcvn 16 күн бұрын
complete jaldi sir waiting asf
@shubharuidas2624
@shubharuidas2624 24 күн бұрын
Please also continue with vision transformer
@manojprasad6781
@manojprasad6781 28 күн бұрын
Waiting for the next video💌
@virajkaralay8844
@virajkaralay8844 Ай бұрын
Absolute banger video again. Appreciate the efforts you're taking for transformers. Cannot wait for when you explain the entire transformer architecture.
@virajkaralay8844
@virajkaralay8844 Ай бұрын
Also, congratulations for 200k subscribers. May you reach many more milestones
@aksholic2797
@aksholic2797 Ай бұрын
200k🎉
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@space_ace7710
@space_ace7710 Ай бұрын
Yeah!!
@oden4013
@oden4013 2 күн бұрын
sir please upload next video please its almost a month
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost.
@ishika7585
@ishika7585 Ай бұрын
Kindly make video on Regex as well
@WIN_1306
@WIN_1306 15 күн бұрын
what is regex?
@peace-it4rg
@peace-it4rg 18 күн бұрын
sir mera doubt that ki mai agar transformer architecture mai batchnorm use karoon kunki jo values matrix mai hai un sabka apna learning rate and bias factor hai to jo bias hai uskai karan to zero chala hi jayega fir layer norm kyun. kyunki ham ((x-u)/var)*lambda+bias krtai hi hain to bias to apne aap usko zero nhi hone dega. Please help sir
@RamandeepSingh_04
@RamandeepSingh_04 17 күн бұрын
still it will be a very small number and will affect the result and not represent the true picture of the feature in batch normalization.
@WIN_1306
@WIN_1306 15 күн бұрын
@@RamandeepSingh_04 compared to others who are without padding it will be small, but still sir wrote zero but zero to nhi hi hoga
@vikassengupta8427
@vikassengupta8427 15 күн бұрын
Sir next video ❤❤
@zerotohero1002
@zerotohero1002 Күн бұрын
one month ho gya sir please upload eagarly waiting🥺🥺🥺
@barryallen5243
@barryallen5243 27 күн бұрын
Just ignoring padded rows while performing batch normalization should also work, I feel like it that padded zeros are not the only reason we layer normalization instead of batch normalization.
@WIN_1306
@WIN_1306 15 күн бұрын
how would you ignore padding cols in batch normalisation?
@user-mw9ny7wc6l
@user-mw9ny7wc6l 24 күн бұрын
Jldi next video dalo sir
@SANJAYTYAGI-bk6tx
@SANJAYTYAGI-bk6tx 24 күн бұрын
Sir In batch normalization , in your example we have three mean and three variance along with same number of beta and gamma i.e. 3. But in layer normalization , we have eight mean and eight variance along with 3 beta and 3 gamma. That means number of beta and gamma are same in both batch and layer normalization. Is it correct? Pl elaborate on it .
@campusx-official
@campusx-official 24 күн бұрын
Yes
@WIN_1306
@WIN_1306 15 күн бұрын
mean and variance are used for normalisation ,beta and gamma are used for scaling
@ESHAANMISHRA-pr7dh
@ESHAANMISHRA-pr7dh 13 күн бұрын
Respected sir, I request you to please complete the playlist. I am really thankful to you for your amazing videos in this playlist. I have recommended this playlist to a lot of my friends and they loved it too. Thanks for providing such content for free🙏🙏
@adarshsagar9817
@adarshsagar9817 27 күн бұрын
sir please complete the NLP playlist
@WIN_1306
@WIN_1306 15 күн бұрын
which one? how many videos does it have?
@not_amanullah
@not_amanullah 7 күн бұрын
🖤🤗
@titaniumgopal
@titaniumgopal 17 күн бұрын
Sir PDF Update karo
@ghousepasha4172
@ghousepasha4172 22 күн бұрын
Sir please complete playlist I will pay 5000 for that
@DarkShadow00972
@DarkShadow00972 Ай бұрын
Bring some coding example bro
@gauravbhasin2625
@gauravbhasin2625 Ай бұрын
Nitish, please relook at your covariate shift funds... yes, you are partially correct but how you explained covariate shift is actually incorrect. (example - Imagine training a model to predict if someone will buy a house based on features like income and credit score. If the model is trained on data from a specific city with a certain average income level, it might not perform well when used in a different city with a much higher average income. The distribution of "income" (covariate) has shifted, and the model's understanding of its relationship to house buying needs to be adjusted.)
@WIN_1306
@WIN_1306 15 күн бұрын
ig , the explanation that sir gave and your explanation are same with different example of covariate shift
@bmp-zz9pu
@bmp-zz9pu Ай бұрын
A video after 2 weeks in this playlist.....itna zulam mat karo.....thoda tez kaam kro sirji..............
@ashutoshpatidar3288
@ashutoshpatidar3288 28 күн бұрын
please be a little fast!
@WIN_1306
@WIN_1306 8 күн бұрын
sir can u tell that around how many and which topics are left?
@Amanullah-wy3ur
@Amanullah-wy3ur 15 күн бұрын
this is helpful 🖤
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
@anonymousman3014
@anonymousman3014 21 күн бұрын
Sir, is transformer architecture completed as I want to cover it ASAP, I have covered the topics till attention mechanism. I want to cover the topic in one go. Sir please tell please. And, sir I request to upload all video asap. I want to learn a lot. And thanks for the amazing course at 0 cost. God bless you.
Loss Functions in Deep Learning | Deep Learning | CampusX
59:56
Самое Романтичное Видео ❤️
00:16
Глеб Рандалайнен
Рет қаралды 4,7 МЛН
THE POLICE TAKES ME! feat @PANDAGIRLOFFICIAL #shorts
00:31
PANDA BOI
Рет қаралды 24 МЛН
Получилось у Вики?😂 #хабибка
00:14
ХАБИБ
Рет қаралды 7 МЛН
House Price Prediction Project | Linear Regression | Machine Learning
34:04
Positional Encoding in Transformers | Deep Learning | CampusX
1:13:15
Layer Normalization - EXPLAINED (in Transformer Neural Networks)
13:34
Batch normalization | What it is and how to implement it
13:51
AssemblyAI
Рет қаралды 56 М.