How to Build a Superhuman Poker AI using CFR | Creating a Poker Bot Part 2

Рет қаралды 44,934

Күн бұрын

Пікірлер: 59

@KylieYYing 3 жыл бұрын

What are your thoughts on superhuman AIs?? P.S. Check out Taskade, a super awesome platform for managing collaboration and organizing projects. It's simple to use and free! www.taskade.com Be sure to subscribe for more fun content like this :)

@govinej 3 жыл бұрын

可以

@ygjt76v0----- 3 жыл бұрын

I love you❤️

@nicolozaniolobedankt 2 жыл бұрын

Go solve cancer not poker

@IshaqIbrahim3 2 жыл бұрын

Just discovered this channel. I wish I discovered it a long time ago. lol!

@atgaming5318 2 жыл бұрын

one of the most underrated computer science youtuber

@skeletonlord2126 2 жыл бұрын

This is not a real tutorial, this is an ad for a task management software.

@utuber1752 2 жыл бұрын

mmmmm.... normally I'd agree, but I think it's ok this time.

@thecaptainubi 3 жыл бұрын

i can't describe in mere words how wonderful you are and how much you've impacted my life. i thank you deeply.

@KylieYYing 3 жыл бұрын

🥺🥺🥺 Wow, thank you for watching!!

@thecaptainubi 3 жыл бұрын

@@KylieYYing omg you actually saw and replied to me :) please continue making awesome content.

@LeeSpaziano 6 ай бұрын

Awesome video @KylieYYing. I was searching for some insight on clusters for CFR, and your take was perfect. I think you also need to abstract other aspects of the game state such as total pot, chip stacks, pot odds, and most importantly, player statistics. I think the self-play aspect of it makes it impossible to learn to play against various player types (i.e. TAG, Fish, Nit, etc). Therefore my multiplayer simulation will use two MCCFRs and four expert systems, each modeled an avatar of a player caricature. Let me know if you'd like to collaborate IRL.

@KeithGalli 3 жыл бұрын

Damn that was a fire intro

@zchamdawala 3 жыл бұрын

Was eagerly waiting for this. Thanks for uploading 🔥👍. You rock 😀

@KylieYYing 3 жыл бұрын

Thank you for your support 🥰

@jonesbts6393 3 жыл бұрын

I love your content, please keep making more if you are able!

@Luckypantera27 Жыл бұрын

Wow Kylie, you are so talented! love all your videos!!!

@faheemhussain3176 Жыл бұрын

A very good video. Im on my to learn programming and a bot. I assume you arent short of money, knowing the jazz. But youve presented ideas well. Regret is a nice tabgent in perspective. Something just for weight, apart from logic. Good story telling. Wosh you the best. 🎉

@flightisallright 3 жыл бұрын

You should get an online casino to sponsor you ;)

@misclic2408 Жыл бұрын

13:00 that was heads up = not a tournament format = tournaments nodes probably is 10 in the tenth more never the less all theese r extremely interestin ,,,please keep on

@trtlphnx 3 жыл бұрын

Damn!!! I Love Your Incredibly Cogent Insight... Damn; I even Think I Love You!!!

@rubyshekhar2869 3 жыл бұрын

Same here

@TypicallyThomas Жыл бұрын

I've never seen a more invasively sponsored video. Also I never trust free products that can afford sponsorship deals. If they pay to give it away for free, you're the product

@TeeTeeTeeL 2 жыл бұрын

Can you make a bot that’s parallel to the hand your in in real time ? Something like a real time assistant???

@andredias5061 3 жыл бұрын

That's all the steps used in training right? I'm trying to understand the Sub-game solving part........

@SoundsOfTrance-rg8xo 19 күн бұрын

Okay, but how do I actually build it?

@whatnow1194 3 жыл бұрын

how good is learning from from videos and do code and create my own mess and again while trying to get rid of own mess ending up creating more mess out of it and it works fine but not perfect?

@jetshot6133 3 жыл бұрын

we have a big project with Nvidia and aws to create python application using machine learning

@SynedocheFernbank Жыл бұрын

Great video 🎉

@heyman620 3 жыл бұрын

I really liked the video - but I think we should not consider CFR as reinforcement learning since CFR aims to minimize regret (not to maximize reward). It looks like it's a small difference, but in practice, it's the difference between trying to possibly exploit our opponent (maximize reward) and minimizing exploitability (CFR) - which also means CFR is unable to be adaptive if I get it correctly. RE RL for poker, I think Noam Brown has some new related work. Though I am not an expert for RL or CFRs as I do mostly NLP... What do you think?

@roboltamy Жыл бұрын

Reinforcement learning doesn't have as much to do with the objective as it does with the way that the model is trained towards it. Reinforcement learning is where you train the model by having it attempt whatever task it is meant to do (in this case playing poker), then based on the outcome relative to its objective (whether that is minimizing regret or maximizing some other reward) it is optimized. All the regret is, is a different objective, not an alternative style of training. Sorry to reply to a year-old comment and hopefully my explanation is clear enough.

@heyman620 Жыл бұрын

@@roboltamy Well, I agree with most of what you say, and what I wrote is not accurate, but nowadays I know much more about RL and still, CFR is clearly not an RL algorithm. Anyway, Barto might disagree here (he says RL is much broader) but the standard approach would be to not call CFR RL, if you do it you might as well call MCTS RL and so on, but it's incorrect. Again, looking at counterfactual outcomes to maximize your outcome is not the way MDPs work, in MDP, by definition, you don't search in history. But nice comment (nice to realize I know a little more after 1 year).

@OleguitoSwagbucks Жыл бұрын

Does it work with more than 1 opponents?

@rizwanahmad9921 2 жыл бұрын

When there will be the next part?

@anpan4318 2 жыл бұрын

Hey Kylie I'm down if you want to build this together. I've been reading CPRG Lab's paper since 2014. Plus I can pay for the infra bill.

@stonertrading Жыл бұрын

let's go to vegas and try it out. +)

@luhao4337 2 жыл бұрын

Is this only for two players games tho?

@sebastiansahlman4754 3 жыл бұрын

Hi! I have a brief programming question about Python, and I wanted to ask it on the most recent video so you would see it. I am new to Python and wondering how to separate the conditions of an "if" statement. Here is an example: w = 0 x = 1 y = 2 z = 3 if w == 0 and x == 1 or y == 2 and z == 3: print('Pass') See how this could be interpreted multiple ways? One way is: if (w is 0 and x is 1) or (y is 2 and z is 3) Another way is: if (w is 0) and (x is 1 or y is 2) and (z is 3) for the first one, w = 0 x = 2 y = 2 z = 3 would not pass the test and print "pass", because w is 0, but x is not 1 for the second one, however, w = 0 x = 2 y = 2 z = 3 would pass the test, because (w is 0), and (x is 1 or y is 2) and (z is 3) How do you separate these conditions in Python? Any help is appreciated. Thanks

@yoyo-yx1qx 3 жыл бұрын

Bro you are confusing yourself for no reason just understand how "and' "or" operators work by using "and" like if x ==1 and y==6 print("ho") you are telling the program that it should only print when both are true if one is not true it will not print however if you say ,if x == 2 or y ==1 print("bruh") you are telling the program print bruh if one of these two is true. after you understand how both this operators work you can play with them anyway you want,i prefer using one operator per 1 if statement in my code.depending on the situation i don't like mixing "and" with "or" in one if statement if it is not required, it creates bugs in my programs.i hope that helped you to understand better.

@khytron06 3 жыл бұрын

If you're using 'and' and 'or' in one line of code. it seems to prefer the 'and' first, and here's how Python interpretes your statement: if (w == 0 and x == 1) or (y == 2 and z == 3): print('pass') notice how the 'and' compared to 'or' is like the multiplication/division and additiion/subtraction. w = 0 x = 2 y = 2 z = 3 would pass the test, bcz y == 2 and z == 3.

@seungsooim2183 2 жыл бұрын

this is not interpreted multiple ways. when you code that conditional statement python requires you to bracket the 'and' together and so the or is between the two conjuncts. nothing arbitrary about it.

@Ms123kill 3 жыл бұрын

But the real question is did you make money on online poker. Asking for a friend I might make one but I heard they have been taking down such outlier performers.

@Fine_Mouche 2 жыл бұрын

6:40 : this algo seem very result oriented, but i heard it's bad to be result oriented in poker. So why it's good for the AI ?

@misclic2408 Жыл бұрын

u thinking about it very simple = its exactly the oposite = it calculates all possible expected values

@Luckypantera27 Жыл бұрын

@@misclic2408 Hi- how can I contact u? I have a question that I am sure you will be able to answer 🙏🏻 Thanks

@Junmarvi 3 жыл бұрын

New Follower here..:). Keep it up..

@urbleeding Жыл бұрын

you are super smart

@MsYann34 3 жыл бұрын

nice video

@meMe-qi1tp 3 жыл бұрын

Bro this could have been the biggest brain video if you decided for the AI to take into account other people’s regrets and use that against them. I have not heard a poker AI developer said they developed their AI that way (or at least they didn’t reveal that)

@KylieYYing 3 жыл бұрын

Actually, that is totally possible! Basically in poker, there's two streets of strategies - game theory optimal (GTO) and exploitative. GTO tried to play at Nash equilibrium (like this CFR bot) because in theory it's unbeatable. However, how close you can get to Nash equilibrium depends on your state/action abstractions and how long you train the bot for. Exploitative play though does exactly what you mentioned. You try to find your opponents' weaknesses (such as their regret) and use that against them. However, in theory, this will lose to a bot playing at Nash equilibrium (since that's what's considered optimal, and a bot playing at Nash equilibrium does not lose in the long run).

@meMe-qi1tp 3 жыл бұрын

@@KylieYYing queen sht but I’m talking about the boy playing the pros only so exploitive might still work idk? Would be fun to see pro poker players like my man Daniel got trolled tho

@guilhermegdeavogtmannsberg3776 2 жыл бұрын

@@KylieYYing there are more conditions that the bot work,not only GTO,Nash etc, Pluribus $350 usd, can do millions, but not at many poker rooms, because they ban