AI Plays Space Invaders. Which machine learning algorithm will learn to play first?

Рет қаралды 26,256

3 жыл бұрын

Reinforcement learning is used to train bots using two different algorithms. Playing a simple space shooter-style game.
Custom gym to use with OpenAI algorithms. Showing how you can create more test environments for your custom algorithms. Once you get your environment set up in the OpenAI Gym format it is super easy to switch between different test algorithms.
github.com/ClarityCoders/Spac...
Want to chat with me and other programmers join our discord!
/ discord

Пікірлер: 31

@daddyofalltrades 2 жыл бұрын

Please do more machine learning projects and tutorials. Your explanations are so easy to follow !! Tysm

@ClarityCoders 2 жыл бұрын

Thanks your comments really mean a lot. Feel free to jump in our discord if you like chatting about programming and such!

@Timmysthirdbirthday 2 жыл бұрын

cool and awesome vidio very cool interesting. your transition noise is intrusive, everything else ROCKS keep it up thx

@ClarityCoders 2 жыл бұрын

Awesome thanks for the feedback! I agree not sure what I was doing with the noise I've improved on my newer videos haha.

@levipack3835 3 жыл бұрын

Definitely some of the most concise and logically thought out videos I've seen on programming in Python. Unfortunately it seems like the growth rate of KZbin videos channels are more linear than exponential but you'll get there. Just keep making videos and getting people to interact with them via comments thumbs up subscription etc sharing them what have you.

@ClarityCoders 3 жыл бұрын

Appreciate the comment and the views! I don't over think it to much it's a hobby for me so I just keep trying to make valuable videos for my viewers. If a few people keep watching I'll keep making them!

@okay730 9 ай бұрын

how did you calcuate loss for the DQN? Im aware that MSE has flaws such as minimizing the loss for all of the neural network's output nodes, and is sensitive to outliers.

@MrVersion21 2 жыл бұрын

From my experience of my phd in robotics i would try the following: give an agent-centered image to the network. Therefore the complexity of the image recognition task is greatly reduced, since most movements onky rely on relative information. If you give the image in absolute coordinates the policy has to learn each avtion for each position in the image individually. More thoughts (I did not check the code) - Color code ship, bullet, enimy and wall - Use image augmentation, mostly random cropping - Initialize policy with human play.

@ClarityCoders 2 жыл бұрын

Centered meaning the ship is always in the center?

@MrVersion21 2 жыл бұрын

@@ClarityCoders yes. Ship in the center. Therefore the position of the ship in the image is always constant. If you need more info, please let me know.

@marcorosano9384 2 жыл бұрын

Also, train it with curriculum learning may help (start with simple episodes, then increase the difficulty)

@gusinthecloud 2 жыл бұрын

Great Great Videos. One of the best educational Videos

@ClarityCoders 2 жыл бұрын

Thanks means a lot. I really enjoyed this video actually!

@sholomschonbuch5946 3 жыл бұрын

Awesome video!

@ClarityCoders 3 жыл бұрын

Thanks! Appreciate it!

@kevin5k2008 3 жыл бұрын

Great content and a timely one for beginners like me!!! I have 2 questions arising from this demo: 1) During training of DQN/PPO, are you aware of any methods such that pygame ONLY renders the GUI when we call it? Like during model evaluation only? 2) Can you help to elaborate more as to why 18 actions are needed for the player's spaceship? Here, I am assuming 4 Actions be sufficient - (Left, Right, Up, Down)? Thanks in advance.

@ClarityCoders 3 жыл бұрын

display.update() renders the gui! action space is bigger than 4 because I accounted for doing two things in one turn. for example moving up is and action and moving up while shooting is another. join discord if you would like to chat about this more! Thanks for watching.