Bellman Equation - Explained!

  Рет қаралды 15,421

CodeEmporium

CodeEmporium

8 ай бұрын

Let's talk about the most consequential equation in reinforcement learning: The bellman equation.
ABOUT ME
⭕ Subscribe: kzbin.info...
📚 Medium Blog: / dataemporium
💻 Github: github.com/ajhalthor
👔 LinkedIn: / ajay-halthor-477974bb
RESOURCES
[1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
[2] Paradigms of ML: idapgroup.com/blog/types-of-m...
[3] Model Free vs Model Based RL: spinningup.openai.com/en/late...
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.net/MathML
📕 Calculus: imp.i384100.net/Calculus
📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
📕 Linear Algebra: imp.i384100.net/LinearAlgebra
📕 Probability: imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
📕 Python for Everybody: imp.i384100.net/python
📕 MLOps Course: imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): imp.i384100.net/NLP
📕 Machine Learning in Production: imp.i384100.net/MLProduction
📕 Data Science Specialization: imp.i384100.net/DataScience
📕 Tensorflow: imp.i384100.net/Tensorflow

Пікірлер: 13
@gauravshinde8767
@gauravshinde8767 6 ай бұрын
KZbin algo, please make the relevance score of this video to 10/10. This video is too good to be ignored
@CodeEmporium
@CodeEmporium 6 ай бұрын
Thank you! Now if only the KZbin gods listen
@jsp991204
@jsp991204 4 ай бұрын
Thanks alot!!😀
@vanilan3585
@vanilan3585 8 ай бұрын
you just make video. what am i about to study😃
@slitihela1860
@slitihela1860 4 ай бұрын
can you prepare a video for Double Q-Learning Network and Dueling Double Q-Learning Network please
@rinibhasin17
@rinibhasin17 2 ай бұрын
Confused :(
@bhaveshachhada7242
@bhaveshachhada7242 4 ай бұрын
I was confused. You made me more confused. This doesn't explain the intuition.
@amiralioghli8622
@amiralioghli8622 8 ай бұрын
Hi sir, Please turn your series direction on implementing Transformers on Time Series data Please we are waiting.
@vasarmilan
@vasarmilan 8 ай бұрын
I never heard anyone using Transformers for time series, doesn't sound to be a good idea
@amiralioghli8622
@amiralioghli8622 8 ай бұрын
@vasarmilan Hi, sir. There has been a lot of research done on implementing transformers in time series. Please do a search on Google, please. However, there are no videos available on KZbin for a step-by-step guide on transformers in time series, only for educational purposes. If someone creates a playlist and uploads a video, it will be the first one on the entire KZbin platform as well as solve a lot of students problem like me.
@vasarmilan
@vasarmilan 8 ай бұрын
​@@amiralioghli8622 I did a Google search now, I see in the last 1-2 years there has been an increased research interest. However, all the papers I see are very much "primer"s that ask the question if there will ever be truly efficient timeseries transformers. While I can see the value in some specific cases, like ones similar to speech (very high dimensionality and discrete, relatively low numbered timesteps), for "textbook" timeseries problems (eg. when you have a single or low numbered timeseries with many steps), there is really no point in trying to apply Transformers, as they are really meant to work with high dimensions. And I never encountered a practical situations so far when a (numerical) timeseries was like that.
@CodeEmporium
@CodeEmporium 8 ай бұрын
While I have mentioned in the past that transformers can be used for time series data, it isn’t standard practice. So if you are blocked on a project, I would recommend looking at either traditional methods (like ARIMA) or Machine Learning methods (like building a regressor) for this. I have a video couple of videos on “Time series forecasting with machine learning” that you can look up. Hope this helps for now :)
@amiralioghli8622
@amiralioghli8622 8 ай бұрын
@@CodeEmporium thank you sir from your replying I did that 🙏
Foundation of Q-learning | Temporal Difference Learning explained!
10:11
PINK STEERING STEERING CAR
00:31
Levsob
Рет қаралды 21 МЛН
A pack of chips with a surprise 🤣😍❤️ #demariki
00:14
Demariki
Рет қаралды 42 МЛН
터키아이스크림🇹🇷🍦Turkish ice cream #funny #shorts
00:26
Byungari 병아리언니
Рет қаралды 19 МЛН
Bellman Equation Basics for Reinforcement Learning
13:50
Skowster the Geek
Рет қаралды 141 М.
Q Learning simply explained | SARSA and Q-Learning Explanation
9:46
Q-learning - Explained!
11:54
CodeEmporium
Рет қаралды 14 М.
Markov Decision Processes - Computerphile
17:42
Computerphile
Рет қаралды 159 М.
Multi Armed Bandits - Reinforcement Learning Explained!
10:33
CodeEmporium
Рет қаралды 7 М.
Policy and Value Iteration
16:39
CIS 522 - Deep Learning
Рет қаралды 132 М.
PINK STEERING STEERING CAR
00:31
Levsob
Рет қаралды 21 МЛН