Foundation of Q-learning | Temporal Difference Learning explained!

  Рет қаралды 10,319

CodeEmporium

CodeEmporium

6 ай бұрын

Let's talk about the foundation concept of Q-learning, SARSA called Temporal Difference Learning.
ABOUT ME
⭕ Subscribe: kzbin.info...
📚 Medium Blog: / dataemporium
💻 Github: github.com/ajhalthor
👔 LinkedIn: / ajay-halthor-477974bb
RESOURCES
[1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
[2] Paradigms of ML: idapgroup.com/blog/types-of-m...
[3] Model Free vs Model Based RL: spinningup.openai.com/en/late...
[4] Bellman Equation video: • Bellman Equation - Ex...
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.net/MathML
📕 Calculus: imp.i384100.net/Calculus
📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
📕 Linear Algebra: imp.i384100.net/LinearAlgebra
📕 Probability: imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
📕 Python for Everybody: imp.i384100.net/python
📕 MLOps Course: imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): imp.i384100.net/NLP
📕 Machine Learning in Production: imp.i384100.net/MLProduction
📕 Data Science Specialization: imp.i384100.net/DataScience
📕 Tensorflow: imp.i384100.net/Tensorflow

Пікірлер: 18
@PrymeOrigin
@PrymeOrigin 5 ай бұрын
You have a gift to teach and I'm very thankful to find someone who breaks down concepts so simply and easy to digest
@CodeEmporium
@CodeEmporium 5 ай бұрын
Thanks so much for the kind words. I really appreciate this
@magroubezpieczeniasp.zo.o.2137
@magroubezpieczeniasp.zo.o.2137 5 ай бұрын
Totally agree!
@LuthandoMaqondo
@LuthandoMaqondo 6 ай бұрын
Nice, quick and straight to the point.
@noahgsolomon
@noahgsolomon 26 күн бұрын
The breakdown of the 1 sentence explanation is so useful
@al_parlam
@al_parlam 4 ай бұрын
man, your explanation is gorgeous ! you are remarkable in explaining complex things. Keep doing what you are doing :) I wish you much luck with your channel
@LaveshNK
@LaveshNK 2 ай бұрын
Fantastic video...I have a RL assignment due and I had no idea wht TD error even meant. You are great at explaining
@akshaypansari111111
@akshaypansari111111 6 ай бұрын
Thanks a lot. This is real helpful. I will check out the bellman equation video as well
@li-pingho1441
@li-pingho1441 6 ай бұрын
awesome explanation!
@krzysztofjarek6476
@krzysztofjarek6476 6 ай бұрын
Great lecture 😉
@minapagliaro7607
@minapagliaro7607 Ай бұрын
Great video !!!!
@krishnavinukonda1882
@krishnavinukonda1882 Ай бұрын
This is best . Thanks!
@slitihela1860
@slitihela1860 3 ай бұрын
can you prepare a video for Double Q-Learning Network and Dueling Double Q-Learning Network please
@davidlieber3494
@davidlieber3494 5 ай бұрын
great video, thanks!
@CodeEmporium
@CodeEmporium 5 ай бұрын
You are very welcome. Thanks for commenting
@yep3659
@yep3659 2 ай бұрын
I'm craving for some Tempuras now
@redrose5406
@redrose5406 6 ай бұрын
Post more about GANs
@satyamdubey4110
@satyamdubey4110 3 ай бұрын
💖💖
Q-learning - Explained!
11:54
CodeEmporium
Рет қаралды 11 М.
Self Attention in Transformer Neural Networks (with Code!)
15:02
CodeEmporium
Рет қаралды 77 М.
Зу-зу Күлпәш. Стоп. (1-бөлім)
52:33
ASTANATV Movie
Рет қаралды 1,2 МЛН
ХОТЯ БЫ КИНОДА 2 - официальный фильм
1:35:34
ХОТЯ БЫ В КИНО
Рет қаралды 1,5 МЛН
Multi-Armed Bandit : Data Science Concepts
11:44
ritvikmath
Рет қаралды 81 М.
Reinforcement Learning: on-policy vs off-policy algorithms
14:47
CodeEmporium
Рет қаралды 5 М.
Bellman Equation -  Explained!
9:05
CodeEmporium
Рет қаралды 13 М.
Monte Carlo And Off-Policy Methods | Reinforcement Learning Part 3
27:06
Mutual Information
Рет қаралды 34 М.
SARSA vs Q Learning
16:31
Marcus Fong
Рет қаралды 10 М.
Monte Carlo in Reinforcement Learning
11:49
CodeEmporium
Рет қаралды 7 М.