Q-learning - Explained!

  Рет қаралды 13,817

CodeEmporium

CodeEmporium

Күн бұрын

Let's talk about one of the more important concepts in reinforcement learning: q-learning
ABOUT ME
⭕ Subscribe: kzbin.info...
📚 Medium Blog: / dataemporium
💻 Github: github.com/ajhalthor
👔 LinkedIn: / ajay-halthor-477974bb
RESOURCES
[1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
[2] Paradigms of ML: idapgroup.com/blog/types-of-m...
[3] Model Free vs Model Based RL: spinningup.openai.com/en/late...
[4] Bellman Equation video: • Bellman Equation - Ex...
[5] Temporal Difference Learning video: • Foundation of Q-learni...
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.net/MathML
📕 Calculus: imp.i384100.net/Calculus
📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
📕 Linear Algebra: imp.i384100.net/LinearAlgebra
📕 Probability: imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
📕 Python for Everybody: imp.i384100.net/python
📕 MLOps Course: imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): imp.i384100.net/NLP
📕 Machine Learning in Production: imp.i384100.net/MLProduction
📕 Data Science Specialization: imp.i384100.net/DataScience
📕 Tensorflow: imp.i384100.net/Tensorflow

Пікірлер: 16
@henoknigatu7121
@henoknigatu7121 2 ай бұрын
Your 12 min video worth than all the playlist about q-learning on youtube👏
@anya_forgerrr
@anya_forgerrr 4 ай бұрын
i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc
@arandomwho
@arandomwho 3 ай бұрын
Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍
@akshaypansari111111
@akshaypansari111111 7 ай бұрын
Really enjoying the series. Keep it up
@CodeEmporium
@CodeEmporium 7 ай бұрын
Thanks so much! Super glad you are enjoying this
@tonihullzer1611
@tonihullzer1611 2 ай бұрын
very good explained, thanks a lot!
@sameertupe6094
@sameertupe6094 Ай бұрын
Very Well explained by you sir,It helped alot
@user-pb6yt8qh3w
@user-pb6yt8qh3w 27 күн бұрын
Thank you so much!!!!!!!!!!!!
@user-qu4is5uk3p
@user-qu4is5uk3p Ай бұрын
thank you so much that was so helpful
@alexanderlevakin9001
@alexanderlevakin9001 7 ай бұрын
What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?
@justsomegirlwithoutamustac5837
@justsomegirlwithoutamustac5837 2 ай бұрын
This is so underrated
@burakkurt1907
@burakkurt1907 11 күн бұрын
Allah razı olsun
@djsocialanxiety1664
@djsocialanxiety1664 3 ай бұрын
thanks man
@khabibownsmysoul7836
@khabibownsmysoul7836 Ай бұрын
May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?
@MrHorse16
@MrHorse16 6 ай бұрын
Q*
@friedrichwilhelmhufnagel3577
@friedrichwilhelmhufnagel3577 6 ай бұрын
Instead of saying grid you could say almost say DFA
Reinforcement Learning: on-policy vs off-policy algorithms
14:47
CodeEmporium
Рет қаралды 6 М.
GPT - Explained!
9:11
CodeEmporium
Рет қаралды 40 М.
Watermelon Cat?! 🙀 #cat #cute #kitten
00:56
Stocat
Рет қаралды 21 МЛН
Backstage 🤫 tutorial #elsarca #tiktok
00:13
Elsa Arca
Рет қаралды 33 МЛН
The Worlds Most Powerfull Batteries !
00:48
Woody & Kleiny
Рет қаралды 27 МЛН
Proximal Policy Optimization | ChatGPT uses this
13:26
CodeEmporium
Рет қаралды 11 М.
Reinforcement Learning, by the Book
18:19
Mutual Information
Рет қаралды 75 М.
Bellman Equation -  Explained!
9:05
CodeEmporium
Рет қаралды 14 М.
A Very Simple Transformer Encoder for Protein Classification in PyTorch
14:19
Let's Learn Transformers Together
Рет қаралды 71
Foundations of Q-Learning
16:59
Dr. Daniel Soper
Рет қаралды 31 М.
Building your first Neural Network
15:16
CodeEmporium
Рет қаралды 3,9 М.
Transformer Neural Networks - EXPLAINED! (Attention is all you need)
13:05
NLP with Neural Networks | ngram to LLMs
13:21
CodeEmporium
Рет қаралды 2,3 М.
Watermelon Cat?! 🙀 #cat #cute #kitten
00:56
Stocat
Рет қаралды 21 МЛН