Yoooo I've been sharing your GCN videos with friends and colleagues for a while now, all of a sudden you drop 3 videos at once? Hell yeah
@projectpiano52315 ай бұрын
Yooo I just started my first RL project a few days ago after neglecting to learn about policy gradient learning and Q-learning for a long time. This could not have been better timing xD I'm implementing PUCT and I'm mostly done which means I may want to switch to a library now that I've almost written it from scratch. Also thanks for the video! I watched the brain-computer interface video too and it was really interesting. I wonder if they use predictive coding and other theories in informing how they model brain activity. To me it sounds like the brain tries to minimize surprise/loss locally in the neocortex (pretty much pattern recognition part of brain) and has reinforcement learning in the striatum and ventral tegmental area (reward center of the brain) but as far as I'm aware there isn't any theory that captures all of that and is biologically plausible
@welcomeaioverlords5 ай бұрын
Good luck on the project and glad to hear you’re finding something of interest here!
@Acceleratedpayloads5 ай бұрын
Will you consider doing a prisoners dilemma experiment using LLM?
@welcomeaioverlords5 ай бұрын
I think there's already work done on this: arxiv.org/abs/2305.07970; arxiv.org/abs/2305.16867. Once I get simple things to work, I'll move to more advanced agents. I'm ultimately interested in applying evolutionary algorithms to populations of agents (which means they have to be small enough for me to handle) and having them solve tasks that require coordination. Prisoner's Dilemma might help benchmark cooperative behavior?